Data Scientist | Data Enthusiast | MSc in Data Science & Analytics
Welcome to my GitHub! I'm passionate about turning complex data into actionable insights using machine learning, analytics, and effective storytelling. I'm always looking to apply my skills in real-world projects and collaborate with like-minded professionals.
A capstone project for my MSc in Data Science, focused on predicting Customer Lifetime Value using:
- π§Ή Data Cleaning & Feature Engineering: Processed 500K+ records from the Online Retail II dataset
- π RFM Analysis & K-Means Clustering: Created meaningful customer segments based on behaviour
- π€ Predictive Modelling: Trained and evaluated Random Forest and XGBoost models
- π Key Outcome: Achieved RΒ² = 0.9864 with Random Forest, outperforming XGBoost in all evaluation metrics
Tools Used: Python, Pandas, NumPy, Scikit-learn, XGBoost, Matplotlib, Seaborn, Jupyter
π Explore the repository for code, visuals, and insights
- π MSc in Data Science & Analytics β University of Hertfordshire
- π Interests: Customer Analytics, Predictive Modelling, ML Explainability, Data Visualization
- π οΈ Skills: Python | SQL | Power BI | Machine Learning | Data Wrangling | Exploratory Data Analysis
- π§ Email: [email protected]
Thanks for visiting my profile!