I design and build production-ready AI systems β from intelligent API gateways and autonomous agents to real-time data pipelines.
I focus on the layer between raw LLM capabilities and real-world applications: routing, memory, orchestration, and observability β the infrastructure that makes AI systems reliable at scale.
An enterprise-grade intelligent API gateway that semantically routes natural language queries to the right AI backend.
User Query β Semantic Router β casual_chat β LiteLLM (gpt-4o-mini)
β code_assistant β LiteLLM (gpt-4o)
β financial_quant β Celery + LangGraph Agent
Key capabilities:
- π§ Local embedding model routing β BAAI/bge-small-en-v1.5, sub-millisecond classification, zero API cost
- π Enterprise auth β SHA-256 hashed keys, Redis fast path + Postgres fallback,
expires_atenforcement - β‘ Token bucket rate limiting β atomic Redis Lua script, per-user buckets
- π€ Autonomous LangGraph agent β live options data, risk-gated recalculation loop, 3-strategy selection
- π Real-time dashboard β intent distribution, RPM sparkline, routing decision + confidence score
- π Production-ready β Nginx TLS, structured JSON logs, X-Request-ID tracing, Docker Compose full stack
Stack: FastAPI Β· LangGraph Β· Celery Β· Redis Β· PostgreSQL + pgvector Β· LiteLLM Β· Sentence-Transformers Β· Nginx
| Domain | What I Build |
|---|---|
| AI Infrastructure | Semantic routers, agent orchestration, LLM gateways |
| Backend Systems | Async APIs, task queues, real-time data pipelines |
| Security | API key management, rate limiting, auth middleware |
| Observability | Structured logging, distributed tracing, metrics dashboards |
Always building. Always shipping.

