GitHub - Katyayani09/medicare_campaigns_marketmixmodels: This repository explores Market Mix Modeling to analyze the causal impact of marketing strategies on sales performance using statistical analysis and OLS regression. It helps businesses optimize campaign investments by distinguishing causation from correlation.

Market Mix Modeling: Causation and Correlation

Understanding the Impact of Marketing Strategies on Sales Performance

📌 Technologies Used: Python, Pandas, NumPy, Statsmodels, Seaborn, Matplotlib

📜 Project Overview

This project investigates the impact of different marketing strategies on sales performance. Using Market Mix Modeling (MMM), the analysis distinguishes causation from correlation in marketing investments, helping businesses optimize resource allocation.

The study leverages regression analysis, correlation analysis, and exploratory data analysis (EDA) to derive insights from historical campaign data.

📂 Table of Contents

Introduction
Data Preprocessing
Feature Engineering
Exploratory Data Analysis (EDA) & Statistical Analysis
Correlation & Causal Inference - Regression Analysis
Recommendations & Strategies
How to Run

1️⃣ Introduction

Marketing campaigns influence customer acquisition, but which strategies truly impact revenue? This project aims to determine:

Which marketing channels drive the most revenue?
Are campaigns causally linked to sales growth or just correlated?
How do different customer types respond to campaigns?

This project uses OLS Regression to quantify marketing effectiveness.

2️⃣ Data Preprocessing

The dataset contains campaign-related variables such as:

Campaign_Email, Campaign_Flyer, Campaign_Phone
Sales_Contact_1, Sales_Contact_2, ..., Sales_Contact_5
Amount_Collected (Target variable)
Client_Type (Large, Medium, Small, Private Facility)

Preprocessing Steps

✔ Handling missing values
✔ Standardizing column names
✔ Creating time-based features (Month, Year)

3️⃣ Feature Engineering

Time-based aggregation: Calendar_Month, Calendar_Year
Customer segmentation analysis: Grouping by Client_Type
Sales impact analysis: Summarizing key revenue drivers

4️⃣ Exploratory Data Analysis & Statistical Analysis

🔹 Key Insights

Distribution of revenue (Amount_Collected) across client types
Correlation of campaigns with sales revenue
Visualization of marketing effectiveness

📊 Example Visualization

Sales distribution per Client_Type
Time-series trend of Amount_Collected
Correlation heatmap of campaign effectiveness

5️⃣ Causal Inference - Regression Analysis

We performed OLS Regression to measure the impact of marketing campaigns on revenue.

🔹 Model Summary

R-squared = 0.480  # 48% variance explained
F-statistic = 342.1 (p-value = 0.00)  # Model is statistically significant
Durbin-Watson = 0.624  # Suggests autocorrelation

🔹 Significant Predictors (p < 0.05)

Feature	Coefficient	Impact on Revenue
Campaign_Flyer	3.34	🚀 Positive
Sales_Contact_1	4.24	🚀 Positive
Sales_Contact_2	3.64	🚀 Positive
Sales_Contact_3	2.34	🚀 Positive
Sales_Contact_4	10.95	🚀 Strongest Impact

❌ Not significant predictors: Campaign_Email, Campaign_Phone, Sales_Contact_5

6️⃣ Recommendations & Strategies

📌 Actionable Insights:

Increase investment in flyer campaigns & sales contacts (significant positive effect).
Campaign phone calls are ineffective—reallocate budget.
Sales Contact 4 has the highest impact—focus on optimizing this interaction.
Different client types respond differently—customized strategies needed.

7️⃣ How to Run

📥 Installation

Ensure you have the required Python libraries installed:

pip install pandas numpy seaborn statsmodels matplotlib

▶ Run the Notebook

Launch Jupyter Notebook or Google Colab and execute:

!pip install pytimetk -q
import pandas as pd, numpy as np, seaborn as sns, statsmodels.api as sm
import statsmodels.formula.api as smf

📊 Reproduce Results

Run all cells to process the data and generate insights.
Analyze the regression summary and visualizations.

🏆 Final Thoughts

This project provides a data-driven strategy to optimize marketing investments by identifying causal relationships, not just correlations. Future improvements include:

Implementing A/B testing for campaign effectiveness.
Exploring machine learning models (e.g., Random Forest, XGBoost) for better predictions.

📢 Feel free to contribute, raise issues, or suggest improvements! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
MarketMixModeling_CausationandCorrelation.ipynb		MarketMixModeling_CausationandCorrelation.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Market Mix Modeling: Causation and Correlation

Understanding the Impact of Marketing Strategies on Sales Performance

📜 Project Overview

📂 Table of Contents

1️⃣ Introduction

2️⃣ Data Preprocessing

Preprocessing Steps

3️⃣ Feature Engineering

4️⃣ Exploratory Data Analysis & Statistical Analysis

🔹 Key Insights

📊 Example Visualization

5️⃣ Causal Inference - Regression Analysis

🔹 Model Summary

🔹 Significant Predictors (p < 0.05)

6️⃣ Recommendations & Strategies

7️⃣ How to Run

📥 Installation

▶ Run the Notebook

📊 Reproduce Results

🏆 Final Thoughts

About

Uh oh!

Releases

Packages

Languages

License

Katyayani09/medicare_campaigns_marketmixmodels

Folders and files

Latest commit

History

Repository files navigation

Market Mix Modeling: Causation and Correlation

Understanding the Impact of Marketing Strategies on Sales Performance

📜 Project Overview

📂 Table of Contents

1️⃣ Introduction

2️⃣ Data Preprocessing

Preprocessing Steps

3️⃣ Feature Engineering

4️⃣ Exploratory Data Analysis & Statistical Analysis

🔹 Key Insights

📊 Example Visualization

5️⃣ Causal Inference - Regression Analysis

🔹 Model Summary

🔹 Significant Predictors (p < 0.05)

6️⃣ Recommendations & Strategies

7️⃣ How to Run

📥 Installation

▶ Run the Notebook

📊 Reproduce Results

🏆 Final Thoughts

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages