
I'm Muhammad Usman Khalil, an AI Engineer and Full-Stack Developer with 4 years of experience building production AI systems, agents, and automation. I help companies turn AI capabilities into real business outcomes.
About me
I'm an AI engineer with 4 years of experience building production-grade ML and agentic AI systems. My focus is the applied LLM stack: RAG pipelines, LLM fine-tuning for domain-specific tasks, and multi-agent workflows using LangGraph, CrewAI, and the Model Context Protocol (MCP). I've also shipped voice agents and intelligent chatbots that handle real customer conversations end to end.
Day to day, I work with Python, PyTorch, Hugging Face, and LangChain, paired with vector databases like Qdrant and Pinecone for retrieval. I take systems all the way to production: containerizing services with Docker, building CI/CD pipelines, and deploying to AWS and Azure with an eye on cost and latency.
I'm also a full-stack developer (React, Next.js, Node.js), which means I can wrap the models I build into actual products people use instead of handing off a notebook and walking away.
The way I work is consistent and disciplined. I show up every day, ship in small steady increments rather than big unreliable bursts, and I'm always trying to learn the next thing, whether that's a new framework, a new architecture pattern, or a sharper way to think about a problem. Most of what I enjoy is the figuring-out part: taking a vague problem, breaking it down, and getting to a clean solution.
What sets me apart isn't any one tool in the stack. It's that I treat AI work as engineering, not magic. I care about whether the system actually runs in production, whether it stays cheap to operate, and whether the people using it get real value back. Plenty of people can build a demo. Fewer can ship something that holds up.
When I'm not coding, I enjoy playing video games and gardening. I'm also learning history and finance, and picking up how to build businesses on the side.
Projects
Real-world projects built with clean code, scalable architecture, and attention to detail.
EVDS Diamond — Full SaaS Platform
Built and deployed a complete SaaS platform for a Spanish diamond-disc manufacturer. Two separate apps on different domains share one API: a customer activation portal where workshops scan a QR code on each disc to activate it within strict 7-day windows, and an internal staff dashboard for label generation, monitoring, and support. Runs in 8 languages and tracks every cut in real time.
- Next.js
- Node.js
- PostgreSQL
- Docker
- Nginx
- Hetzner
- Let's Encrypt



Insurance Policy RAG Chatbot
Production RAG system for a US insurance company covering 50+ policy documents. Users ask about a medicine or procedure and the bot returns exact coverage requirements. Built with parent-child chunking, hybrid search on Pinecone, and CrewAI agents orchestrating GPT-4o. Cut query response time from 2 minutes to 30 seconds by parallelising retrieval with FastAPI multi-threading.
- Python
- FastAPI
- CrewAI
- OpenAI GPT-4o
- Pinecone
- Hybrid Search
- RAG



UBO Compliance Automation (n8n)
Replaced a 7-step manual workflow with a single n8n agent for a KYC compliance team serving global banks. The agent receives the bank email, searches internal records, scrapes Chinese-language sources (Baidu, Aiqicha, GladTrust), compiles a UBO report with AI, updates the database, and emails the result back. Per-lookup time dropped from 30-40 minutes to under 2 minutes at the same accuracy.
- n8n
- OpenAI
- AI Agents
- Web Scraping
- Email Automation
- KYC/AML



AI Chatbot for Financial & Stock Market
Real-time financial assistant that handles stock and market queries through conversational AI. Built with Agno agents for tool orchestration, yfinance for live market data, Groq for low-latency inference, and DeepSeek as the reasoning model. Wrapped in a polished React frontend with Framer Motion. Handles dynamic queries across stocks, market trends, and financial news with sub-second response times.
- Agno
- Groq
- DeepSeek
- yfinance
- React
- Framer Motion



Plant Disease Detection & Treatment
Final-year project using a CNN to analyse plant leaf images, classify disease type, and recommend treatment. Built on EfficientNet-B0 fine-tuned for plant pathology, served through a FastAPI backend with a React frontend. Treatment suggestions are generated via OpenAI tailored to the detected condition. Tested across multiple plant species and disease categories with strong real-world accuracy on field-captured images.
- React
- TensorFlow
- Keras
- FastAPI
- EfficientNet-B0
- OpenAI



VeriCare — Health Episode Tracker
Patient-facing health tracker built for a US doctor practice. Patients log episodes of illness with what happened, when, and how it resolved, and keep a personal medical diary across visits. Unusual design constraint: zero backend. All data lives in browser localStorage for full privacy, with no server roundtrip. Built end-to-end in React and deployed on Vercel.
- React
- LocalStorage
- Tailwind CSS
- Vercel



Liver Cancer Detection (Research)
Research project on binary classification of Hepatocellular Carcinoma from medical imaging. Benchmarked three modern architectures (EfficientNet-B0, TinyViT, MobileViTv2) against each other for accuracy, model size, and inference speed. Focus on edge-case robustness in low-contrast scans and class imbalance, with a full evaluation pipeline including confusion matrices, ROC curves, and per-class precision metrics.
- CNN
- TensorFlow
- Keras
- EfficientNet-B0
- TinyViT
- MobileViTv2



Skills
Experience
Python / ML Developer
Lahore, Punjab
Building AI automation with CrewAI and FastAPI, integrating RAG pipelines into multi-agent LangGraph workflows. Optimizing async execution for faster inference, deploying via Azure and Docker.
Jul 2025 - PresentAI Engineer & Full-Stack Developer
Freelance
Building production AI systems for international clients: RAG pipelines, fine-tuned LLMs, agentic workflows with LangGraph and CrewAI, plus full-stack delivery with React, Next.js, and cloud deployment.
Jan 2023 - PresentWeb Development Intern
Sargodha, Punjab
PHP full-stack development on real-world client projects. Built and maintained web applications using HTML, CSS, and backend workflows alongside a senior development team.
May 2024 - Jul 2024Contact me
Please contact me directly at usman.data002@gmail.com or through this form