This project focuses on analyzing customer behavior, retention, and acquisition trends in an e-commerce dataset. It employs techniques like RFM analysis, cohort analysis, segmentation, and visualization to uncover actionable insights, helping businesses improve their customer lifetime value (CLV) and optimize retention strategies.
- Overview
- Technologies Used
- Project Type
- Data Description
- Solution Approach
- Key Data Insights
- Business Recommendations
- Python (Pandas, NumPy, Matplotlib, Seaborn, lifetimes)
- Jupyter Notebook & Google Colab
- Data Preprocessing/Cleaning and Feature Engineering
- Advanced Visualization Techniques, RFM Analysis, CLV Segmentation, and Cohort Analysis
Column Name | Description |
---|---|
InvoiceNo | Invoice number that consists of 6 digits. If this code starts with the letter 'C', it indicates a cancellation. |
StockCode | Product code that consists of 5 digits. |
Description | Product name. |
Quantity | The quantities of each product per transaction. |
InvoiceDate | This represents the day and time when each transaction was generated. |
UnitPrice | Product price per unit. |
CustomerID | Customer number that consists of 5 digits. Each customer has a unique customer ID. |
Country | Name of the country where each customer resides. |
This project follows a structured data analysis workflow:
- Data Cleaning & Preprocessing: Handling missing values, outliers, and data transformations.
- Exploratory Data Analysis (EDA): Visualizing customer behavior, revenue trends, and segmentation insights.
- RFM Analysis: Categorizing customers based on Recency, Frequency, and Monetary values.
- Cohort Analysis: Analyzing retention trends over time.
- Customer Segmentation:
- Identified 1,306 loyal customers contributing significantly to monthly revenue.
- At-risk customers represent a key area for retention strategies.
- 📊 Customer Lifecycle Analysis:
- Retention rates declined sharply after the second cohort month, highlighting the need for stronger engagement post-onboarding.
- 💰 Revenue Contribution:
- Loyal customers contribute over 75% of the revenue, emphasizing the value of retention efforts.
- 🛍️ Purchasing Behavior:
- Weekdays observed a higher purchasing frequency, suggesting optimal times for targeted campaigns.
- 🔄 Retention Strategies:
- Develop loyalty programs and personalized offers for at-risk customers.
- Automate re-engagement campaigns using email or app notifications.
- 📢 Acquisition Optimization:
- Invest in digital ads targeting high-growth regions like California.
- Run location-based promotional campaigns to tap underperforming regions.
- ⭐ Customer Experience Enhancement:
- Analyze feedback from at-risk customers to address concerns and improve satisfaction.
- Offer exclusive benefits for customers in their first two months to boost retention.
- 📊 Data-Driven Campaigns:
- Leverage RFM analysis to run targeted promotions for high-value segments.
- Use geographic insights to tailor regional marketing campaigns.