Skip to content
View Mohith-akash's full-sized avatar

Block or report Mohith-akash

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mohith-akash/README.md

👋 Mohith Akash

Analytics Engineer & AI Developer

LinkedIn Email


Architecting Serverless Data Platforms | SQL • Databricks • dbt • Polars • Generative AI

Building production data systems at scale — with $0 infrastructure costs


🚀 Featured Projects

Live Demo Code

Real-time geopolitical news analytics with AI chat & emotion tracking

Feature Details
📊 12M+ events from GDELT + GKG feeds
🧠 GKG Emotions: 2.2K+ dimensions analyzed
🌍 200+ countries, 100+ languages
15-min refresh via Dagster + GitHub Actions
🤖 Dual AI: Text-to-SQL + RAG vector search
🔧 dbt Core: Staging/marts transformation
🚀 Polars: 10x faster than Pandas
💰 $0/month — replaces $1,490+ enterprise tools

Python Polars dbt DuckDB MotherDuck
Dagster Voyage AI LlamaIndex Cerebras Streamlit

Live Demo Code

Databricks Lakehouse with Medallion Architecture

Feature Details
📦 100K+ orders analyzed
🧱 Databricks + Delta Lake lakehouse
🏅 Medallion: Bronze → Silver → Gold
📊 Unity Catalog for data governance
SQL Warehouse for fast queries
🔄 Migrated from dbt + MotherDuck
CI/CD with GitHub Actions

Databricks Delta Lake SQL Python
Streamlit Plotly GitHub Actions


🛠️ Tech Stack

Python SQL Databricks Delta Lake Polars
dbt DuckDB MotherDuck Dagster GitHub Actions
LlamaIndex Voyage AI Cerebras RAG Text-to-SQL
Streamlit Plotly Power BI

📈 What I Build

┌─────────────────────────────────────────────────────────────────────────┐
│  🧱 Lakehouse Architecture →  Databricks, Delta Lake, Medallion        │
│  ⚡ Data Pipelines         →  Polars, Dagster, dbt, real-time ELT      │
│  🦆 Cloud Data Warehouses  →  MotherDuck, DuckDB, serverless SQL       │
│  ✅ Data Quality           →  Great Expectations-style validation      │
│  🤖 AI-Powered Analytics   →  RAG, Text-to-SQL, LLM integration        │
│  🧠 Emotion Analytics      →  GKG processing, 2.2K+ dimensions         │
│  🚀 Vector Search          →  Voyage AI embeddings, cosine similarity  │
│  📊 Interactive Dashboards →  Streamlit, Power BI, Plotly              │
│  🔄 CI/CD Automation       →  GitHub Actions, 15-min refresh cycles    │
└─────────────────────────────────────────────────────────────────────────┘

🎓 Certifications

Google Advanced Data Analytics
Google Cybersecurity
AI for Data Professionals


💼 Open to Opportunities

Data Analyst · AI Engineer · Data Engineer · Analytics Engineer · BI Engineer

Pinned Loading

  1. Global-News-Intel-Platform Global-News-Intel-Platform Public

    AI-powered geopolitical news intelligence platform. Ingests 100K+ daily events from GDELT, stores in MotherDuck (DuckDB), orchestrates with Dagster, and features an AI chat interface with Text-to-S…

    Python 4

  2. olist-analytics-platform olist-analytics-platform Public

    End-to-end analytics platform: CSV → Databricks → Delta Lake → Streamlit Dashboard | 100K+ Brazilian e-commerce orders

    Python

  3. Excel-Data-Analyst-Portfolio-Project Excel-Data-Analyst-Portfolio-Project Public

    An interactive Excel dashboard analyzing 13k+ data analyst job postings