Usman Khalil
Muhammad Usman Khalil

I'm Muhammad Usman Khalil, an AI Engineer and Full-Stack Developer with 4 years of experience building production AI systems, agents, and automation. I help companies turn AI capabilities into real business outcomes.

About me

I'm an AI engineer with 4 years of experience building production-grade ML and agentic AI systems. My focus is the applied LLM stack: RAG pipelines, LLM fine-tuning for domain-specific tasks, and multi-agent workflows using LangGraph, CrewAI, and the Model Context Protocol (MCP). I've also shipped voice agents and intelligent chatbots that handle real customer conversations end to end.

Day to day, I work with Python, PyTorch, Hugging Face, and LangChain, paired with vector databases like Qdrant and Pinecone for retrieval. I take systems all the way to production: containerizing services with Docker, building CI/CD pipelines, and deploying to AWS and Azure with an eye on cost and latency.

I'm also a full-stack developer (React, Next.js, Node.js), which means I can wrap the models I build into actual products people use instead of handing off a notebook and walking away.

The way I work is consistent and disciplined. I show up every day, ship in small steady increments rather than big unreliable bursts, and I'm always trying to learn the next thing, whether that's a new framework, a new architecture pattern, or a sharper way to think about a problem. Most of what I enjoy is the figuring-out part: taking a vague problem, breaking it down, and getting to a clean solution.

What sets me apart isn't any one tool in the stack. It's that I treat AI work as engineering, not magic. I care about whether the system actually runs in production, whether it stays cheap to operate, and whether the people using it get real value back. Plenty of people can build a demo. Fewer can ship something that holds up.

When I'm not coding, I enjoy playing video games and gardening. I'm also learning history and finance, and picking up how to build businesses on the side.

Projects

Work I'm Proud Of

Real-world projects built with clean code, scalable architecture, and attention to detail.

EVDS Diamond — Full SaaS Platform

Built and deployed a complete SaaS platform for a Spanish diamond-disc manufacturer. Two separate apps on different domains share one API: a customer activation portal where workshops scan a QR code on each disc to activate it within strict 7-day windows, and an internal staff dashboard for label generation, monitoring, and support. Runs in 8 languages and tracks every cut in real time.

  • Next.js
  • Node.js
  • PostgreSQL
  • Docker
  • Nginx
  • Hetzner
  • Let's Encrypt
EVDS Diamond — Full SaaS Platform preview

Insurance Policy RAG Chatbot

Production RAG system for a US insurance company covering 50+ policy documents. Users ask about a medicine or procedure and the bot returns exact coverage requirements. Built with parent-child chunking, hybrid search on Pinecone, and CrewAI agents orchestrating GPT-4o. Cut query response time from 2 minutes to 30 seconds by parallelising retrieval with FastAPI multi-threading.

  • Python
  • FastAPI
  • CrewAI
  • OpenAI GPT-4o
  • Pinecone
  • Hybrid Search
  • RAG
Insurance Policy RAG Chatbot preview

UBO Compliance Automation (n8n)

Replaced a 7-step manual workflow with a single n8n agent for a KYC compliance team serving global banks. The agent receives the bank email, searches internal records, scrapes Chinese-language sources (Baidu, Aiqicha, GladTrust), compiles a UBO report with AI, updates the database, and emails the result back. Per-lookup time dropped from 30-40 minutes to under 2 minutes at the same accuracy.

  • n8n
  • OpenAI
  • AI Agents
  • Web Scraping
  • Email Automation
  • KYC/AML
UBO Compliance Automation (n8n) preview

AI Chatbot for Financial & Stock Market

Real-time financial assistant that handles stock and market queries through conversational AI. Built with Agno agents for tool orchestration, yfinance for live market data, Groq for low-latency inference, and DeepSeek as the reasoning model. Wrapped in a polished React frontend with Framer Motion. Handles dynamic queries across stocks, market trends, and financial news with sub-second response times.

  • Agno
  • Groq
  • DeepSeek
  • yfinance
  • React
  • Framer Motion
AI Chatbot for Financial & Stock Market preview

Plant Disease Detection & Treatment

Final-year project using a CNN to analyse plant leaf images, classify disease type, and recommend treatment. Built on EfficientNet-B0 fine-tuned for plant pathology, served through a FastAPI backend with a React frontend. Treatment suggestions are generated via OpenAI tailored to the detected condition. Tested across multiple plant species and disease categories with strong real-world accuracy on field-captured images.

  • React
  • TensorFlow
  • Keras
  • FastAPI
  • EfficientNet-B0
  • OpenAI
Plant Disease Detection & Treatment preview

VeriCare — Health Episode Tracker

Patient-facing health tracker built for a US doctor practice. Patients log episodes of illness with what happened, when, and how it resolved, and keep a personal medical diary across visits. Unusual design constraint: zero backend. All data lives in browser localStorage for full privacy, with no server roundtrip. Built end-to-end in React and deployed on Vercel.

  • React
  • LocalStorage
  • Tailwind CSS
  • Vercel
VeriCare — Health Episode Tracker preview

Liver Cancer Detection (Research)

Research project on binary classification of Hepatocellular Carcinoma from medical imaging. Benchmarked three modern architectures (EfficientNet-B0, TinyViT, MobileViTv2) against each other for accuracy, model size, and inference speed. Focus on edge-case robustness in low-contrast scans and class imbalance, with a full evaluation pipeline including confusion matrices, ROC curves, and per-class precision metrics.

  • CNN
  • TensorFlow
  • Keras
  • EfficientNet-B0
  • TinyViT
  • MobileViTv2
Liver Cancer Detection (Research) preview

Skills

PythonPyTorchTensorFlowLangChainLangGraphCrewAIRAG PipelinesLLM Fine-tuningMCPAgentic AIQdrantPineconen8nPythonPyTorchTensorFlowLangChainLangGraphCrewAIRAG PipelinesLLM Fine-tuningMCPAgentic AIQdrantPineconen8n
NLPVoice agentsReactNext.jsFastAPIDjangoPostgreSQLMongoDBDockerAWSAzureCI/CDGitNLPVoice agentsReactNext.jsFastAPIDjangoPostgreSQLMongoDBDockerAWSAzureCI/CDGit

Experience

Contact me

Please contact me directly at usman.data002@gmail.com or through this form