- Python
- Machine Learning
- Data Analysis/Visualisation
- Deep Learning
- NLP
- SQL
- Django3
With over 4+ years of experience, I excel as a highly skilled Data Scientist specializing in AI engineering, particularly in Generative AI, NLP, and machine learning. I have a demonstrated history of deploying sophisticated ML models in production settings and exhibit adeptness in both Django Rest Framework (DRF) and Flask REST APIs.
I have expertise in the following skills, for more information look at my resume
April 2022 - October 2023
Accomplished efficient data retrieval from a community platform by hitting 30 APIs daily using CRON Jobs. Employed Python and Pandas for data transformation and successfully stored the processed data in BigQuery. Utilized NLP techniques to analyze user comments and determine sentiment, enhancing insights for further decision-making.
Successfully deployed fine-tuned LLM models (Llama2 and Mistral) using Hugging Face, Flask API, and Docker on AWS EC2. Achieved a remarkable 40% cost reduction and 30% latency reduction by employing quantized versions of the models.
Advanced utilization of Hugging Face Transformers and open-source NLP/NLG modules to architect sophisti- cated language models and innovative applications.
Leveraging PySpark, created data pipeline using AWS GLUE, processed semi-structured data from Marketo APIs, seamlessly integrating it into PostgreSQL database which reduced manual workload by 20% monthly.
Tech: Python, Machine Learning, GCP, AWS GLUE, S3, EC2, Lambda, LLM, HuggingFace, NLP, Data Pipelines, PySpark, Generative AI.
April 2021 - Present
Designed and developed RESTful APIs using Django Rest Framework, seamlessly integrating and deploying deep learning endpoints to enable advanced computer vision tasks.
Optimized database performance with Redis caching, reducing query response times by 40%, and implemented Celery for distributed processing, which increased in system scalability.
Implemented Nginx as a reverse proxy and load balancer for DRF APIs, significantly boosting the platform’s capacity to handle 30% more concurrent requests.
Utilized Docker for streamlined application deployment and effectively managed AWS EC2 instances, achieving cost savings through efficient resource utilization.
Tech: Python, Deep Learning, nginx, docker, S3, EC2, celery, redis, MySql, ORM, Selenium.
January 2021 - April 2021
Designed and developed Image Scrapper - Created a Flask API utilizing Selenium to scrape images from a designated website, subsequently storing them in S3. Managed the storage of all associated metadata in an SQLite database.
August 2020 - present
Currently working on Neural Style transfer techniques
February 2020
Won 2 nd place in NLP hackathon conducted by Grey Atom. Certificate
December 2019
Won 1 st place in machine learning hackathon conducted by GreyAtom. Certificate
July 2019 - November 2020
I learned here very deep intuition about mathematics of machine learning algorithms, which gave me extra boost in my confidence for building deep and complex models. Deep learning models are not a blackbox to me anymore since I can interpret each and every layer . Had a very competitive and friendly environment.
July 2016 - June 2019
My whole journey of computer science started from here. After learning data structures and algorithms I became very much comfortable in various languages such as C++ and C. Later I learned Python and that was the starting of my machine learning journey :)
May 2014 - June 2015
Completed my high school with science stream, Later I decided to go for computer science because I was always passionate to learn how computers actually works at low level
These Portfolio will speak for my skills. Feel free to explore some of my best portfolios. You can find all of my portfolios here :)
I can do these task very comfortably. Feel free to reach me out if you want any of these services and more..
I can build Machine learnning models from scratch as well as using high level APIs such as scikit learn. I can build regression models, classification models such as linear regression, logistic regression, KNN, SVM, CART, Xgboost, naive bayes with hyperparameter tuning without doing any data leakage.
I know from classical techniques such as bow, tfidf to advance techniques such as Word2Vector Sent2Vector, Transformers, BERT, DistilBERT. I can use regex for preprocessing the text data while optimizing the code. I can use deep learning techniques to train models with pre defined GLOVE vectors via Embedding layers.
I can build DNN, RNN, LSTM, GRU, CNN and R-CNN. I can use either tensorflow or high level APIs such as keras for builing such models. I can build models in both sequential and functional way using keras. I can customize callbacks and accuracy metric for models to get a better and variety of results.
I can work with both AWS and GCP. I know AWS lambda, EC2, S3 ,ELB. I can configure a compute engine in both AWS and GCP.
I can deploy machine learning models in cloud using flask framework.
I have been learning machine learning and deep learning from past 3 years and I can teach about these in great detail. Thats why I write blogs and loving it.
I'm currently available for work. If you have a project you'd like to discuss, then please email me via the form below.
biswasshubendu4@gmail.com
shubendu.github.io