Skip to content
View darshanr-c's full-sized avatar

Block or report darshanr-c

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
darshanr-c/README.md

💫 About Me

Data Enthusiast | Analytics & AI | M.Sc. Data Science

I'm a data science graduate student based in Berlin, passionate about solving real-world problems using data and AI. My work spans across:

  • 📊 Data Analytics & Business Intelligence
  • ⚙️ End-to-end Machine Learning Pipelines (Airflow, Docker)
  • 🤖 Large Language Models (LLMs) & Retrieval-Augmented Generation
  • 🧠 Data Engineering & Automation Workflows I’m fluent in English (C1) and speak intermediate German (B1). Currently open to internships and entry-level roles in Data Analytics and AI.

💻 Tech Stack

Python R AWS Google Cloud Apache Airflow MySQL Postgres Matplotlib NumPy Pandas Plotly scikit-learn TensorFlow Scipy Docker Terraform Power Bi


🌱 Featured Projects

LLM-Powered Match Summarization (Master's Thesis)

  • Developed a cricket match summarization system from commentary using Large Language Models.
  • Fine-tuned LLaMA-2 with LoRA/QLoRA on 400k+ lines IPL commentary. Implemented factual guardrails and evaluated with ROUGE, F1 Score and Human Reviews.
  • Achieved >95% factual accuracy enabling automation of human-like post-match summaries.

Marketing ROI & Customer Value Analysis

  • This end‑to‑end case study dives into a marketing dataset to answer business questions such as which channels generate the most profit relative to ad spend, how valuable customers are over time and whether marketing spend is efficient
  • Instead of stopping at exploration, the project adds simulated business context, computes KPIs (CLTV, ROI, CAC & Churn) and segments customers by channel, income and behaviour
  • It demonstrates business‑focused analytics and data storytelling. Python, pandas, seaborn, matplotlib

Superstore Sales Analysis Analyses Global Superstore retail data using SQL and Python

  • The project loads and cleans data in PostgreSQL, extracts actionable insights through SQL queries and visualises the results, then summarises findings for business relevance

LLM‑Powered Match Summarisation

  • Developed a retrieval‑augmented generation pipeline using Hugging Face, FAISS and LLaMA 2 to summarise cricket matches.
  • Cleaned and aligned 500 k+ texts and evaluated outputs for coherence and quality.

Airflow‑Based ML Automation

  • Built and deployed an ML pipeline using Airflow and Docker to automate data ingestion, pre‑processing and model retraining.
  • Reduced manual maintenance by 80 %.

🤝 Key Strengths

  1. Team‑player & fast learner
  2. Analytical rigour and attention to detail
  3. Curiosity and problem‑solving mindset
  4. Structured organiser

📚 Education

  • Master’s in Data Science – University of Europe for Applied Sciences, Berlin (Sept 2023 – August 2025)
  • B.Sc. Computer Science – University of Pune, India (Jul 2019 – Sept 2022)

Certifications

  1. AWS Cloud Foundations (2024)
  2. BCG GenAI Job Simulation on Forage (2025)
  3. Python for Data Science 101 (2022)
  4. KNIME Analytics Platform L1 (2024)

🌐 Socials

LinkedIn email

Pinned Loading

  1. aaron_recruiter aaron_recruiter Public

    Forked from jakobap/aaron

    BuildwithAI Challenge

    TypeScript

  2. Thesis_Darshan_Chaudhari Thesis_Darshan_Chaudhari Public

    Jupyter Notebook

  3. sql_superstore_analysis sql_superstore_analysis Public

    Python

  4. Marketing_Analysis_ROI Marketing_Analysis_ROI Public

    Jupyter Notebook