Motogear International, a premier supplier of high-performance bikes and accessories, has been at the forefront of innovation and customer satisfaction. With a diverse product range spanning mountain bikes, road bikes, essential bike accessories, components and clothing, the company has expanded its market presence across π¦πΊ Australia πΊπΈ USA π¨π¦ Canada π©πͺ Germany π¬π§ UK and π«π· France
π Project Type: SQL Analysis | MySQL | Data Analytics
π Objective: Perform exploratory and advanced data analysis to extract insights from customer, product, and sales data, helping to drive business decisions.
Here are some key performance indicators derived from the dataset:
- π Total Sales: $29,356,250
- π¦ Total Quantity Sold: 60,423 units
- π° Average Price per Unit: $486
- π Total Orders Processed: 27,659
- π·οΈ Total Products Available: 295
- π₯ Total Customers: 18,484
- Database: MySQL
- Querying & Analysis: SQL
- Version Control: Git & GitHub
The dataset represents Motogear Internationalβs customer base, product catalog, and sales transactions from 2010-12-29 to 2014-01-28, with products sold across six countries. It consists of three key tables:
Stores customer details, including demographics and account creation data.
| Column Name | Data Type | Description |
|---|---|---|
customer_key |
Integer | Unique identifier for each customer (Primary Key). |
customer_id |
Integer | Internal customer ID. |
customer_number |
String | Unique customer number used for tracking. |
first_name |
String | Customer's first name. |
last_name |
String | Customer's last name. |
country |
String | Customer's country of residence. |
marital_status |
String | Marital status (Single/Married). |
gender |
String | Gender (Male/Female). |
birthdate |
Date | Customerβs date of birth. |
create_date |
Date | Date of account creation. |
Contains details about the products sold, including pricing and categorization.
| Column Name | Data Type | Description |
|---|---|---|
product_key |
Integer | Unique identifier for each product (Primary Key). |
product_id |
Integer | Internal product ID. |
product_number |
String | Unique product number for tracking. |
product_name |
String | Name of the product. |
category_id |
String | Product category ID. |
category |
String | Product category (Bikes, Components, Accessories, Clothing). |
subcategory |
String | Subcategory of the product (e.g., Mountain Bikes, Road Frames). |
maintenance |
String | Indicates if the product requires maintenance (Yes/No). |
cost |
Integer | Cost of the product. |
product_line |
String | Product line (e.g., Road, Mountain). |
start_date |
Date | Date when the product was introduced. |
Records sales transactions, linking customers and products.
| Column Name | Data Type | Description |
|---|---|---|
order_number |
Integer | Sales transaction identifier (Primary Key). |
customer_key |
Integer | Foreign Key linking to dim_customers. |
product_key |
Integer | Foreign Key linking to dim_products. |
order_date |
Date | Date when the order was placed. |
quantity |
Integer | Number of units sold. |
price |
Integer | Price per unit of the product. |
sales_amount |
Integer | Total revenue generated (quantity * unit_price). |
shipping_date |
Date | Date when the order was shipped. |
- Dimension Tables (
dim_*): Contain descriptive attributes about customers and products. - Fact Table (
fact_sales): Stores transactional data related to customer purchases. - Primary Keys (
*_key): Uniquely identify rows in dimension tables. - Foreign Keys (
customer_key,product_key): Establish relationships between tables.
- Load the data into a SQL database (MySQL, PostgreSQL, or SQLite).
- Perform exploratory data analysis (EDA) using SQL queries.
- Use Python (Pandas, Matplotlib, Seaborn) to visualize trends.
The project is divided into three major phases:
Objective: Understand the structure and characteristics of the dataset.
β
Basic Database Exploration β Identifying missing values, duplicates, and inconsistencies.
β
Dimension Exploration β Analyzing customer and product attributes.
β
Date Exploration β Examining sales trends over time.
β
Measure Exploration β Assessing key metrics like total revenue and order count.
β
Magnitude Analysis β Identifying high-value customers and top-selling products.
β
Ranking Analysis β Ranking customers, products, and sales regions based on performance.
Objective: Derive deeper insights into customer behavior and sales trends.
β
Changes Over Time Analysis β Examining year-over-year growth and seasonality.
β
Cumulative Analysis β Tracking cumulative revenue and customer acquisition.
β
Performance Analysis β Measuring sales efficiency and customer retention.
β
Part-to-Whole Analysis β Understanding contributions of different products and regions.
β
Data Segmentation β Categorizing customers into segments based on spending patterns.
Objective: Summarize findings in a structured format for business decision-making.
Purpose: Consolidates key customer metrics and behaviors.
β Key Features:
- 1οΈβ£ Demographics & Transaction Details β Names, ages, and spending patterns.
- 2οΈβ£ Customer Segmentation β Classification into VIP, Regular, and New customers.
- 3οΈβ£ Aggregated Customer Metrics:
- Total Orders β Number of purchases made.
- Total Sales β Overall revenue contribution.
- Total Quantity Purchased β Units bought over time.
- Total Products Purchased β Diversity in purchases.
- Lifespan (Months) β Duration of customer activity.
- 4οΈβ£ Key Performance Indicators (KPIs):
- Recency
- Average Order Value (AOV)
- Average Monthly Spend
Purpose: Consolidates key product metrics and behaviors.
β Key Features:
- 1οΈβ£ Product Spicifications β Name, category, subcategory, and cost.
- 2οΈβ£ Product Segmentation β Classification into High-Performers, Mid-Range, or Low-Performers.-
- 3οΈβ£ Aggregated Customer Metrics:
- Total Orders β Number of purchases made.
- Total Sales β Overall revenue contribution.
- Total Quantity Sold β Units bought over time.
- Lifespan (Months) β Duration of product activity.
- 4οΈβ£ Key Performance Indicators (KPIs):
- Recency
- Average Order Revenue (AOR)
- Average Monthly Revenue
β Insight: New customers generate the highest revenue (37.81%), indicating strong acquisition efforts, while VIPs (36.59%) provide sustained value, and Regular customers (25.59%) may need engagement strategies to boost spending.
β Insight: Sales experienced a significant rise from 2010 to 2013, peaking in 2013 with over 16 million in total sales.
β Insight: Bikes dominate total sales, contributing 96.46% of revenue, while Accessories and Clothing combined account for less than 4%. This suggests that the business is heavily reliant on bike sales, and there may be opportunities to diversify revenue streams by improving the sales strategy for Accessories and Clothing.
π Customer Retention Strategies π
πΉ Target VIP customers with exclusive offers
πΉ Engage New customers with onboarding discounts
πΉ Improve Regular customer frequency through loyalty programs
π Product Optimization ποΈ
πΉ Focus on Bikes & Accessories for revenue maximization
πΉ Reduce inventory for Low-Performing items
π Market Expansion π
πΉ Highest sales in πΊπΈ USA β expand marketing efforts
πΉ Emerging markets in π¦πΊ Australia need localized campaigns
β
More Advanced Analytics: Machine learning predictions for future sales.
β
More Visuals: Interactive PowerBI dashboards.
π€ Reet Chandra
π§ reetphy@gmail.com
π LinkedIn