AI Engineer
Darshil Kapadia
10 years at IBM India delivering computer vision, NLP, and agentic AI for global enterprises. I design, train, and ship end-to-end AI systems — from custom model fine-tuning to cloud-scale production deployment.
About
I'm an AI Engineer based in India with a decade of experience at IBM, building production AI systems for global enterprise clients — PepsiCo, Bacardi, Dow Chemicals, JSW, FAA, NedBank, Iffco-Tokio, and more. My work covers the full stack: data pipelines, model training and fine-tuning, agentic system design, and cloud deployment on AWS and Azure. I hold an M.Tech from IIT Kharagpur and a B.Tech in Electronics & Communication Engineering.
Generative AI
Decoder-only transformers trained on next-token prediction. State of the art for open-ended generation, reasoning, and instruction following.
Bidirectional transformers that produce rich contextual embeddings. Best suited for classification, NER, and semantic similarity.
Seq2seq architecture mapping input sequences to output sequences. Used for summarisation, translation, and question answering.
Generative models that learn to reverse a noise process. State of the art for high-fidelity image and media synthesis.
ML & Deep Learning
Region-based and anchor-free detectors for localising and classifying objects in images. Applied in insurance claim assessment and industrial inspection.
Hierarchical spatial feature extractors. Backbone of most computer vision pipelines for classification and representation learning.
Recurrent architectures that model temporal and sequential dependencies. Applied in OCR, time-series analysis, and sequence labelling.
Attention-based architecture that unified NLP and vision. Trained via self-supervised objectives before task-specific fine-tuning.
Adversarial generator–discriminator framework for producing realistic synthetic data and high-quality image generation.
Message-passing networks that learn on graph-structured data — knowledge graphs, recommendation systems, and molecular modelling.
Policy optimisation through environment interaction. Underpins LLM alignment (RLHF) and sequential decision-making agents.
Compressing large teacher models into faster, smaller student models while preserving accuracy — critical for production deployment.
Classical algorithms for structured and tabular data. Fast to train, interpretable, and often the right tool before reaching for deep learning.
Supervised
Unsupervised
NLP
Languages
Python Ecosystem
Cloud & Infrastructure
Hands-on experience with core AWS services for data engineering, ML workloads, and application infrastructure.
Deep production experience across the Azure ecosystem — from ML pipelines and vector search to DevOps and real-time compute.
Databases
Projects
No projects yet.
Experience
IBM India
Data Scientist / AI EngineerAug 2016 – Present- ›Built end-to-end computer vision, NLP, and agentic AI solutions for global enterprise clients including PepsiCo, Bacardi, Dow Chemicals, JSW, FAA, NedBank, and Iffco-Tokio.
- ›Fine-tuned large language models using PEFT techniques (LoRA, QLoRA) and aligned models with RLHF; deployed RAG and GraphRAG pipelines in production.
- ›Designed and deployed ML systems on AWS and Azure with Docker, Kubernetes, and full MLOps practices.
IIT Kharagpur
M.Tech — Telecommunication Systems Engineering2014 – 2016Dharmsinh Desai University (DDIT)
B.Tech — Electronics & Communication Engineering2009 – 2013Certifications
View all 17 on Credly →
Databricks Certified Generative AI Engineer Associate
Databricks
May 2026

IBM Generative & Agentic AI Expert — Data Scientist
IBM
Sep 2025
AWS Partner: Agentic AI Essentials
Amazon Web Services
Nov 2025
AWS Partner: Generative AI Essentials
Amazon Web Services
Oct 2025

Deep Learning Specialization
DeepLearning.AI
Sep 2020

Generative AI with Large Language Models
DeepLearning.AI
Jan 2024


