Tool used for analysis : SQL
Data Availability
• The data set for the analysis comprises of three tables
- Customer: Customer Demographics
- Transactions: Customer Transaction Details
- Product Category: Product category and sub category information
The following data schema explains relationship of the tables
erDiagram
CUSTOMER {
int customer_id PK
date DOB
string Gender
int city_code
}
TRANSACTIONS {
int transaction_id PK
int cust_id FK
date tran_date
int prod_subcat_code
int prod_cat_code
int Qty
float Rate
float Tax
float total_amt
string Store_type
}
PROD_CAT_INFO {
int prod_cat_code PK
string prod_cat
int prod_sub_cat_code
string prod_subcat
}
CUSTOMER ||--o{ TRANSACTIONS : "makes"
PROD_CAT_INFO ||--o{ TRANSACTIONS : "categorizes"
Approach to the business problem
• DATA PREPARATION AND UNDERSTANDING
- Getting the total number of rows in each of the three tables in the database to understand the tables and records
- Counting the return transactions to Understand the true costs of returns and the potential opportunities to reduce these costs
- Understanding the time range of data for the analysis and many more
• DATA ANALYSIS
- Identifying the most commonly used channel for transactions in order to adopt the best payment strategy for processing payments in stores would provide retailers with new possibilities to capture consumers' hearts.
- Demographic analysis - Counting the female and male customers, as men and women having different shopping preferences different strategies are needed to attract customers in stores
- City wise sales summary to improve the sales strategies as city wise and many more.