If you are here its probably because you got redirected here through my twitter or talks π or my CV
First and foremost my love from computer science started when i was in my middle school and made/coded my first animation on Flash. Though i am a generalist, my speciality is getting PoCs to an MVP and and MVP to a production ready system. I work on OpenML and multiple commercial EU projects like Stairway2AI and AIonDemand. I am involved in architecture design, schema, deployment and development of ML systems on these platform. I have worked on standardization of datasets, models and evaluations. I have developed OpenML BOOST platform for education for biomedical students for ML course. On OpenML I worked on designing the backend and cloud infrastructure. At Kramphub i am working on creating machine learning platform and creating data science workflows with business. Few engineering tools i use in my day2day:
- Kubernetes: For cluster management, networking, docker and object storage.
- Python: My goto language for APIs, automation and data engineering.
- React: involuntarity learnt it, can't say I'm a master of it but can work through JS.
- DevOps/MLOps: Its a very fast evolving feild but i am trying to keep up with latest news.
Currently i work as a data scientist at Microsoft. My job and hobby's entails looking at data and meta data to make tools, data can be used to tell stories and make better decisions. I am a huge advocate for Open Data and Responsible data science. Another reason i loved working @OpenML. I used to work in TolaData for data science for Monitoring and analytics for different NGOs. Personally I am looking for more opportunity to contribute to data science for soclial good and climate change.
I worked as a researcher @TU/e and work on multiple research project. I like solving real world problems like Machine learning on Dirty datasets, Imbalanced datasets, explainable machine leanring and Human in the loop AI. I enjoy research topics as well like Meta-learning, dataset similarity, metric leanring, AutoML and Neural architecture search. If you would like to collaborate in any of these topics then feel free to contact me π My publications can be found on my Scholar
I write blogs and give talks from time to time. These days you can check some of my work on OpenML blog. I love talking about Open data, reproducibility, ML standardization and ML education.
I am used to be part of communities and help building communities everywhere i can. I used to be a part of PyData Delhi, I stated PyData Tartu and PyData Heidelberg as well. Currently I am working on building OpenML community. I am also part of NUMFOCUS on multiple DISC projects.
I have given few workshops and taught Machine Learning Engineering course at TU Eindhoven. I used to speak often at PyCon conferences before corona.
Thank you for reading through this profile. I love collaborations, I am especially interested if you have a project which directly contributes positively to society or environment.
- Indicator library: Django API for indicator search.
- Tola Reports: Dashboards for M&E enterprises
- AutoAI: AutoML API for financial data
- OpenML website: Flask API for OpenML backend
- Dataset 2.0: Next generation dataset formats for future datasets
- Flow 2.0: Next generation model storage for new ML APIs
- OpenML K8s: K8s integrtation in OpenML powered by distributed storage from MinIO
- BOOST Education: Education platform for ML students
- Online autoML: Automl for datastreams
- AutoBalance: AutoML for imbalanced data
- StairwAI: Benchmarking platform for EU ML assets
- AIonDemand: AI asset Marketplace
- LOTUS: AutoML for unsupervised tasks
- Demand Forecasting for Kramphub warehousing
- KRAMP-AI: AI Chatbot for kramphub internal documents
- AutoML: Added automl capabilities to organisation wide teams.
- Transferability estimation: Implemented transferability estiamtion in sugical setting, medical image classification and made a sensible benchmark for it