Specializing in building scalable AI systems, advanced Machine Learning models, RAG pipelines, and agentic workflows. I bridge the gap between research and production to deliver high-impact solutions. Machine Learning, Generative AI, Deep Learning, Data Science.
(And yes, this website is created by AI with a very deep blend of my prompt engineering and software engineering skills 🤖)

Professional Profile

I am an AI Engineer with a strong foundation in Python, Machine Learning, and Generative AI. I specialize in developing production-ready AI solutions, including RAG systems, autonomous agents, and fine-tuned LLMs. My expertise extends to end-to-end application development, allowing me to build robust applications that integrate complex AI logic with user-friendly interfaces.
My focus is on delivering measurable business value through automation and optimization. Whether it's reducing operational costs with intelligent workflows or enhancing user engagement with context-aware chatbots, I approach every project with a problem-solving mindset and a commitment to engineering excellence.
I thrive in collaborative environments and am adept at translating technical concepts for diverse stakeholders. I am continuously expanding my skill set to stay at the forefront of AI innovation.
Started with fine-tuning and RAG techniques for specialized domains.
Deepened expertise in ML algorithms and data science methodologies.
Developed end-to-end AI applications with Python backends and modern frontends.
Deployed scalable LLM solutions using vLLM and Ollama for production environments.
AI Engineering & Development
Building production-ready AI systems with proven impact
Complex issue resolution & analysis
Leading & mentoring dev teams
Strategic decision making
Technical & stakeholder alignment
Development Journey

With over 3500+ organic downloads on PyPI, CodeFabric is an AI-powered Python package that automates codebase generation with 85% accuracy, covering 95% of the development lifecycle — all via a user-friendly CLI.

A fine-tuned 1B parameter model trained on 10M+ tokens of psychologist conversational data for empathetic, contextually relevant mental health support. It comes with ollama compaitable GGUF format for easy access on any device with CPU.

Self-hosted AI chatbot platform that turns your documents into intelligent conversational assistants using RAG. Build unlimited chatbots with customizable personalities and extensive RBAC.

AI-powered EDA agent built with CrewAI and programmable Jupyter Notebook tools, allowing LLMs to directly control notebooks for data analysis pipelines.

End-to-end on-device fall detection system using mobile inertial sensors and Deep Learning. Features multi-task TFLite models, real-time inference (<20ms), and robust false-positive control.

Run small open-source LLMs fully on-device. Privacy-first, offline inference app using Flutter and Cactus runtime.

JusticeAI: India's first AI-powered legal advisor, built on authentic Indian law books and data, leveraging RAG for accurate, context-aware legal guidance.

Built an AI-powered platform to document legacy applications (COBOL, RPG) and accelerate modernization into modern architectures, reducing planning and documentation time by up to 90%.

Deployed and integrated LLaMA 3 and LLaMA 4 models using Ollama and vLLM for scalable inference. Enhanced performance and reduced latency with advanced KV caching.

Designed a hybrid ML-AI system for predicting job titles and salaries using EDA, feature engineering, Random Forest, and RAG-based generative models on 1.5M+ records, achieving up to 95% accuracy.
Contact Me
I'm always open to discussing new projects, creative ideas, or opportunities to be part of your visions. Feel free to reach out through any of these channels!
Looking for a passionate AI Engineer? Let's build something amazing together! 🚀