Dedicated and results-driven Data Scientist with a proven track record of leveraging data analytics to drive actionable insights and solve complex business challenges. Proficient in machine learning techniques, data visualization, and statistical analysis. Experienced in collaborating cross-functionally to develop innovative solutions and support data-driven decision-making. Seeking opportunities to contribute my expertise to a dynamic organization committed to harnessing the power of data.
Kapture CX
Bengaluru, India
Developed and implemented a VoiceBot solution powered by Large Language Models (LLM), integrating Automatic Speech Recognition (ASR) and Speech Synthesis to enhance customer engagement and support automation. Winner, Kaphacks 5.0 Hackathon: Built a voice intelligence pipeline for speaker and gender identification from audio, and LoRA fine-tuned Llama-3.1-8B-Instruct to build an in-house LLM trained on conversation transcript data. 2nd Runner-Up, Kaphacks 3.0 Hackathon: Developed SRE GPT, a Google Chat-integrated system that parses Git commit histories and server logs to provide error insights using LLM and RAG.
Kshema General Insurance Ltd
Hyderabad, India
Designed and automated a crop insurance Claim Management System, reducing turnaround time from days to minutes Represented the company at the National Level under the YES-TECH program of PMFBY at TNAU Coimbatore Developed a state-of-the-art object detection model using YOLOv8 architecture to identify crop types from field images
Innoplexus Consulting Services Pvt Ltd
Pune, India
Developed novel solutions for addressing Drug Discovery challenges by harnessing a spectrum of AI models Received the Best Performer award for securing Royalty Partnership for the company through data-driven strategies and predictive modeling with an accuracy over 90%, leveraging LLMs, NLP and Computer Vision techniques Collaborated with cross-functional teams to define and refine project objectives, ensuring alignment with business goals
B.Tech.
Relevant Coursework: DSA, Data Mining & Knowledge Discovery, ML for Signal Processing, Image Processing Awarded a Minor degree in Industrial and Management Engineering
XII
(95.4%)
X
(10.0 CGPA)
Kapture
Developed a lightweight voice activity detection model (SemanticVAD) by fine-tuning Qwen3-0.6B on conversational data for 4 dialogue states (<|start−speaking| >, <|continue−speaking| >, <|start−listening| >, <|continue−listening| >) Achieved 89.1% overall accuracy, with > 90% accuracy on most classes, by leveraging a custom token hidden state pooling method and fine-tuning with a hybrid loss (Cross-Entropy, KL Divergence, Entropy) for robust performance.
Kshema
Developed a pixel classification model for flood mapping using Sentinel-1 satellite data to track flooded regions Identified crop-damaged pixels in agricultural land clusters by analyzing pre- and post-event Sentinel-2 satellite imagery Conducted time series analysis utilizing NDVI values from remote sensing data to determine optimal harvest date for crops Built and deployed the GIS Surveyor application for claim management using Flask in a production environment
Innoplexus
Performed data collection, cleaning and processing, including ETL, aggregation and scraping from multiple databases Trained a GCN (Graph Convolutional Network) model in semi-supervised fashion to predict cancer driver genes in a network Implemented PyTorch Geometric GNN Explainer for model interpretability, providing node and edge importance masks
Innoplexus
Developed a generative AI model using fine-tuned GPT-2 to synthesize novel amino acid sequences with enhanced binding affinities for antigens, accelerating drug development and therapeutic advancements Utilized CNN and Random Forest regression models to predict binding affinities and prioritize antibody sequences Shared 100 optimized sequences; wet lab tests showed 80% above benchmark, 10% below but binding, and 10% not binding