Skip to content

PremKumar135/PremKumar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PREM KUMAR.K

Senior Data Scientist | Experience: 5+ yrs

REACH ME:

Email: [email protected]
Contact: 8838787234
LinkedIn: https://www.linkedin.com/in/prem-kumar-870820133

PROFESSIONAL EXPERIENCE:

  1. Quantiphi Analytics Pvt Ltd , Mar,2021 - Present
  2. Kryptos Technologies Pvt Ltd , Nov,2019 - Nov,2020
  3. Tata Consultancy Services , Jan,2017 - Mar, 2019

EDUCATION:

  • B.Tech in Mechanical Engineering, Pondicherry University, 2012-16, CGPA: 9.01/10

SKILLS:

  1. Languages : Python.
  2. Frameworks : Flask, Scikit-Learn, Keras, Tensorflow, river.
  3. Tools : Git, SQL, Sagemaker, Docker, Airflow, MongoDB, DVC, Mlflow, ElasticSearch.
  4. Cloud : AWS & its services.
  5. Technologies : Machine Learning, Deep Learning, Computer Vision, NLP.
  6. Major Libraries : Numpy, Pandas, Matplotlib, Seaborn, cv2, NLTK, Librosa, BeautifulSoup.

PROJECT 1: CHORUS DETECTION


This project is a music retrieval system that detects instances of chorus sections in the music recording. Generally, the chorus section in the audio represents the most repetitive part of the song.

chorus detection
Photo by Kelly Sikkema on Unsplash

Tools : AWS EC2, Python, Docker, Flask.
Technology : Machine Learning, Audio Processing, Image Processing.
Algorithm : DBSCAN.

RESPONSIBILITIES:
  • Given audio, convert the audio into a time-time similarity matrix and a time-lag similarity matrix using chorma features.
  • Smoothing and binarizing the similarity matrix using image processing techniques.
  • Finding the clusters in the processed image that represent the similar parts in the audio.
  • Find the matching chorus, trim the found choruses from the audio, and split the audio into a vocal component and an accompaniment component.
  • Finding the presence of a voice using spectral energy
  • I packaged the model into a Docker container, and it is published in the AWS MarketPlace.

PROJECT 2: SIMILAR IMAGES RETRIEVAL


This project is a similar image retrieval system that, given a query image, retrieves similar images from the database of images.

image_similarity
Photo by Dietmar Ludmann on Unsplash

Tools : AWS EC2, python, Elastic Search, AWS Sagemaker.
Technology : Machine Learning, Deep Learning, Image Processing.
Algorithm : K-Nearest Neighbor, ResNet.

RESPONSIBILITIES:
  • The task here is, given a query image, to retrieve all the similar images in the database.
  • We used ResNet to find the encoding of all the available images in the dataset.
  • Uploaded all the image feature encodings in the AWS ElasticSearch with unique ids for each image.
  • Given a query image, find the encoding using the ResNet model and use elastic search KNN to find the k nearest neighbours.

PROJECT 3: BAGS CLASSIFIER


Focused on an image classification model that classifies the different types of bags from a single luxury brand in order to automate the manual process.

bags_classifier
Photo by Diana Akhmetianova on Unsplash

Tools : AWS EC2, python, AWS Sagemaker, AWS Rekognition.
Technology : Deep Learning, Image Processing.
Algorithm : ResNet.

RESPONSIBILITIES:
  • The task here is to build an image classification model that classifies the given image.
  • Implemented different techniques to counter the imbalanced dataset as the data was imbalanced.
  • Have used AWS Rekognition and the AWS Sagemaker built-in image classification algorithm model to train the model and got the evaluation metrics for both of the models.
  • Explain to the client about these two models, their corresponding metrics, and the cost requirements of using them, and suggest which is the least expensive.
  • Deployed the Sagemaker model using the Sagemaker endpoint

PROJECT 4: MLOPS PLATFORM


Worked in an agile environment to build a MLOps platform where we built and tested an entire ML lifecycle using Sagemaker BYOC, pipelines using Airflow for different ML use cases, and helped organizations migrate their use cases to our platform.

MLOPS
Photo by Glenn Carstens-Peters on Unsplash

Tools : Python, Sagemaker Pipelines, Sagemaker Projects, Vertex AI(GCP), Kubeflow, Airflow, Docker.
Technology : Machine Learning, Deep Learning.

Responsibility:
  • Created a components for each ML lifecycle like processing, training, explainability, and monitoring using Sagemaker BYOC and Docker.
  • Created a different pipelines for different tasks like training, retraining, inference, creating endpoints, and monitoring using Airflow.
  • Built a sample projects with each use case, like tabular, computer vision, and NLP to test the entire ML lifecycle using the platform we have created.
  • Documented the user manual for do's and dont's of components, pipelines, and their inner workings and how to create them on their own for their own use cases.
  • Helped organizations migrate their projects into our platform and gave them deep dive sessions on how to migrate them on their own.

PROJECT 5: PACING RATE PREDICTION


This project focused on predicting the pacing rate, which helps to determine the number of outbound calls in a call center in a bank, in order to optimize the work force in the campaign.

pacing_rate
Photo by Austin Distel on Unsplash

Tools : Python, MongoDB.
Technology : Machine Learning, Time Series.

RESPONSIBILITIES:
  • The task here is to predict the pacing rate in order to automate the manual change of the pacing rate in the call center application of the bank. 
  • The pacing rate determines how many outbound calls to dial at a particular time.
  • Did explanatory data analysis and time series analysis to find out the call acceptance pattern, time taken by the caller to accept, and other features.
  • Built a formula that can predict the pacing rate and created a script that can dynamically predict the pacing rate for a particular time using those features.
  • Deployed the script in production and used cron to schedule it.

PROJECT 6: EMOTION DETECTION MODEL


This project focused on an image classification model that classifies the different types of emotions of a human while playing the dart game in the entertainment center.

emotion_detection
Photo by Tengyart on Unsplash

Tools : AWS EC2, Python, Flask.
Technology : Deep Learning, Image Processing.
Algorithm : Xception Network.

RESPONSIBILITIES:
  • The task here is to predict the emotions of the players playing the dart game in the entertainment center.
  • Collected the images using the camera used to shoot the players while playing the game.
  • Used different augmentation techniques to create more data for the training.
  • Tried different face detection models to find out the best face detection system since the camera is nearly 2 meters away from the user.
  • Trained an Xception model to classify human emotions in the particular frame of the video.
  • Used tflite to reduce the model size to deploy the model on the on-prem devices.
  • Built a Flask API to predict emotion in the given image and sent the image for the next process of making the highlights video of the good moments during the play.

PROJECT 7: AGE & GENDER DETECTION


This project focuses on two image classification models, where one model detects the gender and the other detects the age of the user for verification during the KYC process in the bank.

age_gender_det
Photo by Nathan Dumlao on Unsplash

Tools : Python.
Technology : Deep Learning, Image Processing.
Algorithm : MobileNet for Face Detection, ResNet for Gender Detection.

RESPONSIBILITIES:
  • The task here is to predict the gender and age of the user to verify during the KYC process.
  • Collected dataset from various resources for both the application and the client as well.
  • Implemented different techniques to counter the imbalanced dataset as the data was imbalanced.
  • Used multi-threading techniques to run both the model and deployed the model in the production on-prem devices in the bank.

PROJECT 8: STENCIL NUMBER EXTRACTION


This project focused on extracting the stencil numbers from the tires of an automobile.

stencil_number
Photo by Nazim Zafri on Unsplash

Tools : Python, AWS EC2.
Technology : Machine Learning, Image Processing, Deep Learning.
Algorithm : OCR.

RESPONSIBILITIES:
  • The task here is to extract the stencil number from the given image, which contains tires.
  • Tried different image processing techniques like smoothing, binarizing, edge detection, skew correction, dewaring, etc. to extract the words out of it.
  • Tried to use AWS Textract and AWS rekognition to extract the words out of the image.
  • Tried OCR techniques like EasyOCR and Pytesseract to extract the text out of the images.

PROJECT 9: EXTRACT & VERIFY SIGNATURES


This project focused on extracting and verifying the signatures of a user in the bank.

signature_verify
Photo by Signature Pro on Unsplash

Tools : Python.
Technology : Machine Learning, Deep Learning, Image Processing.
Algorithm : Yolov3, Siamese Network.

RESPONSIBILITIES:
  • The task here is to extract the signature and verify the signature with the original signature.
  • Trained Yolov3 to extract the signatures from the given image
  • Trained a siamese network model to verify whether both images are similar or not.

PROJECT 10: PREDICTION OF BEST TOOL


This project focused on predicting the best tool for the manufacturing process under given operating conditions.

Best_tool
Photo by Greg Rosenke on Unsplash

Tools : Python.
Technology : Machine Learning, Natural Language Processing.
Algorithm : Logistic Regression.

RESPONSIBILITIES:
  • The task here is to predict the best tool for the manufacturing process under certain operating conditions.
  • Given the text document, extract a feature using regex and natural language processing and save it as a CSV.
  • Tried different machine learning algorithms to figure out which model performed better for our metric.
  • Deployed the model into production on on-prem devices.

PROJECT 11: Forecasting of Resource Utilization


This project focused on forecasting resource utilization for the telecom industry.

Tools : Python, AWS Forecast.
Technology : Machine Learning, Time Series Forecasting.

RESPONSIBILITIES:
  • The task here is to forecast the resource utilization for the given time interval.
  • Given the csv files that contain data on the utilization of the past. 
  • Tried different preprocessing techniques using Pandas and saved it the way AWS Forecasting demands it.
  • Build different models using AWS Forecasting to build forecasting models and compare the results.

PROJECT 12: Migration of a machine learning model from GCP to AWS environment


This project focused on the migration of ML models from the GCP environment to the AWS environment.

Tools : Python, AWS Sagemaker, Docker.
Technology : Machine Learning.

RESPONSIBILITIES:
  • The task here is to migrate the machine learning model that was deployed in the GCP environment to the AWS environment.
  • Used Sagemaker BYOC concept to migrate the model from GCP to AWS. 
  • Sagemaker BYOC expects a Docker container that contains the environment for the model to be deployed, so we created a Dockerfile that does that.
  • Once the Docker image has been created, the Sagemaker endpoint has been created using the Docker image and the model from the GCP environment.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published