Skip to content
View Mihirmaru22's full-sized avatar

Block or report Mihirmaru22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mihirmaru22/README.md

Hi, I'm Mihir 👋

Machine Learning Engineer | ML Systems • Time-Series • Reliability

I build production-oriented ML systems where labels are missing, noise is high, and failures are expensive. My focus is on how ML behaves under real-world constraints, not just model accuracy.


What I Work On

  • Label-free & weakly supervised ML problems
  • Time-series modeling and drift detection
  • Full ML Lifecycle: Preprocessing, Training, Tuning, Validation & Testing
  • Offline learning + deterministic online systems
  • Failure modes, alert fatigue, and explainability

I prefer statistical ML and explicit objectives over opaque black-box models when reliability matters.


Featured Project — BLACKICE ❄️

Hybrid ML System for Streaming Regime Shift Detection

BLACKICE detects persistent behavioral drift in infrastructure metrics using a hybrid ML architecture.

ML Problem (Constraints) ML Approach (Solution)
No labeled data Streaming statistical baselines (Welford)
Highly noisy, bursty signals Offline optimization of decision boundaries
False positives > delays Custom SRE-weighted loss function
Black-box opacity Persistence-aware detection (not point anomalies)

Impact: ~80–90% noise filtered, <1% false positives, O(1) memory, tested on 8GB+ production data.

🔗 Repo: https://github.com/Mihirmaru22/blackice


Other Projects

Local Fire Weather AI 🌲

Real-time forest fire risk assessment API

  • Precision Modeling: Ridge Regression pipeline with automated feature scaling.
  • Production Ready: Serialized Joblib model served via RESTful Flask interface.
  • Deployment: Container-friendly structure for AWS/Render.

🔗 View Code


Technical Skills

Tech Stack

Currently Learning


Open Source

  • Contributor to PyTorch TorchData: Fixed a core batching deadlock (PR #1522) and documented non-reproducibility in ParallelMapper (PR #1523).

Pinned Loading

  1. vaultZero vaultZero Public

    Zero-trust file custody platform where even the platform cannot misuse data — and suspicious access is detected in real time.

    JavaScript

  2. jivanta jivanta Public

    This repository contains a Work-in-Progress (WIP) backend for the Jivanta project, which aims to address the lack of accessible medication in Tier-3 cities and rural areas.

    JavaScript

  3. fire-prediction- fire-prediction- Public

    This system leverages Ridge Regression and a standardized data pipeline to predict the Fire Weather Index (FWI) with high precision. Designed with a modular architecture, it serves predictions via …

    HTML

  4. BLACKICE BLACKICE Public

    BLACKICE is a label-free, streaming anomaly detection system designed for SRE observability. It uses Welford’s algorithm and persistence logic to filter transient noise and identify structural regi…

    Python