In the telecom industry, preventing customer churn is crucial for long-term success. This data analysis focuses on understanding customer attrition by conducting Exploratory Data Analysis (EDA) using Python. The goal is to uncover patterns and insights that guide informed decision-making.
Initial findings show an overall churn rate of 26.58%, serving as our base target value for future comparisons.
- "Total Charges" column data type corrected to float64 for accurate analysis.
- "Tenure Months" grouped into six bins for improved visualization, creating a new column, "tenure_group."
- "Customer ID" and "Tenure" dropped as irrelevant for focused analysis.
- Converted categorical values to numerical using get_dummies.
- Explored correlations between numerical features and "Churn" column.
- Identified attributes associated with higher churn rates.
- Customers with attributes like month-to-month contracts (42.71%), no online security (41.78%), no tech support (41.65%) exhibit higher churn.
- Customers with less than 1 year of tenure have a churn rate of 47.68%.
- Users who choose electronic check as their payment method show a higher likelihood of churn at 45.29%.
- Month-to-month contract subscribers exhibit a higher churn rate of 42.71%.
- Customers using Fiber Optic (41.89%) instead of DSL (19.00%) are more prone to churn.
- Senior Citizens have a relatively higher churn rate of 41.68%.
- Customers without the following features Online Security (41.78%), Tech Support (41.65%), Online Backup (39.94%), and Device Protection (39.14%) are more likely to churn.
- Paperless Billing Method users have a churn rate of 33.59%, surpassing conventional paper billing at 16.38%.
- Customers without partners (32.98%) are more likely to churn than those with partners (19.72%).
- Customers without dependents (31.28%) are more likely to churn than those with dependents (15.53%).
- Not streaming Movies (33.73%) and not streaming TV (33.54%) contribute to a higher likelihood of churn.
Key Insights: Unveiling Churn Patterns through Bivariate Analysis
- Customers with less than 1 year of tenure (47.68%), especially with fiber optic services (29.61% more) and paperless billing (24.19% more), are more likely to churn.
- Users selecting electronic check payments (45.29%), particularly without tech support (26.67% more) and choosing fiber optic, amplify vulnerability.
- Month-to-month contract subscribers (42.71%), especially with fiber optic services, reveal an increasing churn pattern; females in this group are 2.05% more likely to churn than males.
- Senior Citizens exhibit a relatively higher churn rate of 41.68%, particularly with a contract less than a year and in the absence of a partner.
- Customers without online backup (39.94%) and security (41.78%) play a major role in churn, with a 28.42% higher likelihood compared to users with online security and no online backup.
- Customers without device protection (39.14%), especially with fiber optic services, and without partners (14.43% more) and dependents (16.75% more), are more inclined to churn.
- Customers without partners (32.98%), females in this group are 2.91% more likely to churn than males.
- Customers not streaming movies (33.73%), particularly with fiber optic services (23.38% more) and without partners (14.64% more), are more likely to churn. Additionally, customers not streaming both movies and TV have a 2.65% higher likelihood of churning compared to users streaming TV and not movies.
- Utilizing Machine Learning: Dive into constructing a robust machine learning model to leverage the insights gathered.
- Proactive Retention Strategies: Apply predictive analytics to forecast potential churners, facilitating the implementation of proactive retention strategies.