Subhadeep Mandal

Hi, I'm Subhadeep 👋

Data Scientist and AI/ML Engineer. I love building things and helping people. Very active on twitter.

About

A self taught data scientist and machine learning engineer with a background in Computer Science Engineering and over 2 years of experience in the AI/ML industry. My work interests include Applied Machine and Deep Learning, Generative AI, Reinforcement Learning and MLOps. At Genpact, I am working with the AI/ML Practice team to develop cutting edge solutions leveraging Generative AI. I also engage myself in developing custom finetuned Vision, Language and Multimodal solutions for variety of usecases in Healthcare, Manufacturing, Finance and Retail. As a part of R&D I participate in publishing our research work at top conferences and also patenting solutions with the core team.

Gallery

Work Experience

Genpact LLC.

January, 2024 - PRESENT

Data Scientist/AI Engineer

Bengaluru, India

Deep Learning and Generative AI R&D in Healthcare and Finance: Generated data synthetically using proprietary and open-source LLMs for fine-tuning LLMs and VLMs for finance, insurance and medical use-cases. Fine-tuned Phi 3 and Phi 3.5 using the synthetically generated datasets. Used Unsloth for faster and memory efficient fine-tuning. Evaluated the models on test-set and created comprehensive report for reference.
Multimodal Retrieval-Augmented Generation for QA Systems: Build multi-modal RAG application for question answering on documents (scanned PDFs).
Enterprise GenAI and MLOps Solutions in Healthcare and Finance: Deployed fine-tuned models on cloud services. Built consumable REST APIs serving endpoints to run inference on the deployed models.
State-of-the-Art Audio Language Frameworks Implementation: Built end-to-end plug-and-play framework and application for audio based use-cases involving summarization, sentiment analysis, insights and feedback generation.
Developing Low Code Frameworks for Large Vision Models Fine-Tuning: Developed a library for low code fine-tuning traditional computer vision models using custom data, with minimal code. Supports image classification, segmentation, object detection, image generation and VLM fine-tuning.
Papers and patents authoring: Authored in two papers over audio use-cases for IIMB ICBAI-2024 conference. Currently focused on patenting our latest RnD solutions on Generative and multi-modal AI.

Genpact LLC.

July, 2023 - December, 2023

Data Science Inern

Bengaluru, India

Developed state-of-the-art vision transformers for medical and industrial applications. Created explainable AI solutions to visualise and interpret model performance.
Designed and built a Flask web application with an HTML user interface to efficiently utilise the model and provide real-time results. Conducted experiments with MONAI Deploy SDK for model deployment in the medical domain.
Generated a comprehensive report comparing models based on key metrics such as performance, accuracy, training time, and inference time.
Explored vision-language models tailored for medical applications.
Developed smart agent based solution with Langchain to automate Doc QnA, Source Retrieval and Summarization with RAG, as a part of Genpact's Inhouse Hackathon.
Explored few Multimodal VLMs for Visual Question Answering, focusing medical domain.
Developed model for detecting AI generated text. Developed Gradio and FastAPI based User Interface utilizing the models.
Experimented with Nvidia's RIVA framework and OOTB models for S2T, T2S and NMT tasks. Created a comprehensive report comparing performance of Nvidia's and open source audio models.

iNeuron.ai

August, 2022 - November, 2022

Data Science Intern

Remote

Built a stock price prediction model using Machine Learning. Developed the pipeline using Machine Learning algorithms like Decision Tree and XGBoost
Built a sentiment classification system using NLP and Logistic Regression, using twitter data.

Indian Institute of Technology, Bombay

January, 2022 - June, 2022

Flutter Developer Intern

Remote

Education

Skills

Python

React

PyTorch

FastAPI

HuggingFace

Flask

Langchain

Azure

AWS

GCP

Github

TypeScript

JavaScript

PostgreSQL

Docker

Kubernetes

Java

My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

Qwen Mental Health Chatbot System

July 2025 - September 2025

Finetuned the Qwen 3 4B Thinking model using an open-sourced Mental Health dataset from Huggingface. Built a FastAPI application to wrap the fine-tuned model with performant inference server and memory to persist user conversations. Added Patient, Cases and User handling to create an End-to-end mental health chatbot application.

Python

PyTorch

HuggingFace

FastAPI

TypeScript

PostgreSQL

TailwindCSS

Website

Github (UI)

Github (FastAPI)

Godot RAG Assistant

June 2024 - Present

Developed an agentic RAG application for Godot Documentation QnA Assiatent, to help Godot developers to quickly learn/debug games. Uses godot docs, Huggingface Godot qna dataset and other internet sources in real-time to answer queries. Built a simple UI using Chainlit. Currently working on building a proper UI using NextJS.

Python

PyTorch

Langchain

HuggingFace

FastAPI

TypeScript

PostgreSQL

TailwindCSS

Next.js

Github

Gemini Mail Assistant

June 2024 - Present

A Gemini app that summarizes your daily mails from Gmail.

Python

Langchain

HuggingFace

FastAPI

Github

Certifications

Verified Skills

Sep 2024

Fully Automated MLOps

Datacamp

2024-09-10

View

Sep 2024

MLOps Concepts

Datacamp

2024-09-10

View

Sep 2024

MLOps Deployment and Life Cycling

Datacamp

2024-09-10

View

Jun 2022

Data Scientist with Python

Datacamp

2022-06-10

View

Jul 2022

Data Manupulation with Python

Datacamp

2022-07-10

View

Jul 2023

Deep Learning with PyTorch: Zero to GANs

Jovian

2023-07-10

View

Mar 2023

Machine Learning with Python: Zero to GBMs

Jovian

2023-03-10

View

Apr 2021

Google Cloud Training - Cloud Engineering Fundamentals, Cloud Application Development, Cloud ML-AI Google Cloud Training - Cloud Engineering Fundamentals, Cloud Application Development

Google Cloud

2021-04-10

View

Jun 2020

Java Programming: Problem Solving with Software

Coursera

2020-06-10

View

Publications

Articles and Papers

Building a Multi-Vector Chatbot with LangChain, Milvus, and Cohere

2024-12-04

In the fast-growing area of digital healthcare, medical chatbots are becoming an important tool for improving patient care and providing quick, reliable information. This article explains how to build a medical chatbot that uses multiple vectorstores. It focuses on creating a chatbot that can understand medical reports uploaded by users and give answers based on the information in these reports. Additionally, this chatbot uses another vectorstore filled with conversations between doctors and patients about different medical issues. This approach allows the chatbot to have a wide range of medical knowledge and patient interaction examples, helping it give personalized and relevant answers to user questions. The goal of this article is to offer developers and healthcare professionals a clear guide on how to develop a medical chatbot that can be a helpful resource for patients looking for information and advice based on their own health reports and concerns.

Self Hosting RAG Applications On Edge Devices with Langchain and Ollama–Part II

2024-08-27

In the second part of our series on building a RAG application on a Raspberry Pi, we’ll expand on the foundation we laid in the first part, where we created and tested the core pipeline. In the first part, we created the core pipeline and tested it to ensure everything worked as expected. Now, we’re going to take things a step further by building a FastAPI application to serve our RAG pipeline and creating a Reflex app to give users a simple and interactive way to access it. This part will guide you through setting up the FastAPI back-end, designing the front-end with Reflex, and getting everything up and running on your Raspberry Pi. By the end, you’ll have a complete, working application that’s ready for real-world use.

Self Hosting RAG Applications On Edge Devices with Langchain and Ollama – Part I

2024-08-01

This article follows that journey, showing how to transform this small device into a capable tool for smart document processing. We’ll guide you through setting up the Raspberry Pi, installing the needed software, and building a system to handle document ingestion and QnA tasks. By the end, you’ll see how even the smallest tech gadgets can achieve impressive results with a bit of creativity and effort.

RAG Application using Cohere Command-R and Rerank – Part 2

2024-06-03

In the previous article, we experimented with Cohere’s Command-R model and Rerank model to generate responses and rerank doc sources. We have implemented a simple RAG pipeline using them to generate responses to user’s questions on ingested documents. However, what we have implemented is very simple and unsuitable for the general user, as it has no user interface to interact with the chatbot directly. In this article, we will modularize the codebase for easy interpretation and scaling and build a Streamlit application that will serve as an interface to interact with the RAG pipeline. The interface will be a chatbot interface that the user can use to interact with it. So, we will implement an additional memory component within the application, allowing users to ask follow-up queries on previous responses.

RAG Application with Cohere Command-R and Rerank – Part 1

2024-05-07

The Retrieval-Augmented Generation approach combines LLMs with a retrieval system to improve response quality. However, inaccurate retrieval can lead to sub-optimal responses. Cohere’s re-ranker model enhances this process by evaluating and ordering search results based on contextual relevance, improving accuracy and saving time for specific information seekers. This article provides a guide on implementing Cohere command re-ranker model for document re-ranking, comparing its effectiveness with and without the re-ranker. It uses a pipeline to demonstrate both scenarios, providing insights into how the re-ranker model can streamline information retrieval and improve search tasks.

A Beginner’s Guide to Evaluating RAG Pipelines Using RAGAS

2024-05-07

In the ever-evolving landscape of machine learning and artificial intelligence, the development of language model applications, particularly Retrieval Augmented Generation (RAG) systems, is becoming increasingly sophisticated. However, the real challenge surfaces not during the initial creation but in the ongoing maintenance and enhancement of these applications. This is where RAGAS—an evaluation library dedicated to providing metrics for RAG pipelines—comes into play. This article will explore the RAGAS library and teach you how to use it to evaluate RAG pipelines.

Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.

About

Work Experience

Genpact LLC.

Genpact LLC.

iNeuron.ai

Indian Institute of Technology, Bombay

Education

Haldia Institute of Technology

Sri Chaitanya Educational Institution, Visakhapatnam

De Paul School, Visakhapatnam

Skills

Check out my latest work

Qwen Mental Health Chatbot System

Godot RAG Assistant

Gemini Mail Assistant

Verified Skills

Fully Automated MLOps

MLOps Concepts

MLOps Deployment and Life Cycling

Data Scientist with Python

Data Manupulation with Python

Deep Learning with PyTorch: Zero to GANs

Machine Learning with Python: Zero to GBMs

Google Cloud Training - Cloud Engineering Fundamentals, Cloud Application Development, Cloud ML-AI Google Cloud Training - Cloud Engineering Fundamentals, Cloud Application Development

Java Programming: Problem Solving with Software

Articles and Papers

Building a Multi-Vector Chatbot with LangChain, Milvus, and Cohere

Self Hosting RAG Applications On Edge Devices with Langchain and Ollama–Part II

Self Hosting RAG Applications On Edge Devices with Langchain and Ollama – Part I

RAG Application using Cohere Command-R and Rerank – Part 2

RAG Application with Cohere Command-R and Rerank – Part 1

A Beginner’s Guide to Evaluating RAG Pipelines Using RAGAS

Get in Touch