Skip to content
View pjwan2's full-sized avatar

Block or report pjwan2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pjwan2/README.md

Hi, I'm PJ πŸ‘‹

Building enterprise-grade AI infrastructure and agentic systems

LinkedIn GitHub


About Me

I design and build production-ready AI systems β€” from intelligent API gateways and autonomous agents to real-time data pipelines.

I focus on the layer between raw LLM capabilities and real-world applications: routing, memory, orchestration, and observability β€” the infrastructure that makes AI systems reliable at scale.


Featured Project

An enterprise-grade intelligent API gateway that semantically routes natural language queries to the right AI backend.

User Query β†’ Semantic Router β†’ casual_chat  β†’  LiteLLM (gpt-4o-mini)
                             β†’ code_assistant β†’ LiteLLM (gpt-4o)
                             β†’ financial_quant β†’ Celery + LangGraph Agent

Key capabilities:

  • 🧠 Local embedding model routing β€” BAAI/bge-small-en-v1.5, sub-millisecond classification, zero API cost
  • πŸ” Enterprise auth β€” SHA-256 hashed keys, Redis fast path + Postgres fallback, expires_at enforcement
  • ⚑ Token bucket rate limiting β€” atomic Redis Lua script, per-user buckets
  • πŸ€– Autonomous LangGraph agent β€” live options data, risk-gated recalculation loop, 3-strategy selection
  • πŸ“Š Real-time dashboard β€” intent distribution, RPM sparkline, routing decision + confidence score
  • πŸ”’ Production-ready β€” Nginx TLS, structured JSON logs, X-Request-ID tracing, Docker Compose full stack

Stack: FastAPI Β· LangGraph Β· Celery Β· Redis Β· PostgreSQL + pgvector Β· LiteLLM Β· Sentence-Transformers Β· Nginx


Tech Stack

Python FastAPI Redis PostgreSQL Docker Nginx GitHub Actions


Areas of Focus

Domain What I Build
AI Infrastructure Semantic routers, agent orchestration, LLM gateways
Backend Systems Async APIs, task queues, real-time data pipelines
Security API key management, rate limiting, auth middleware
Observability Structured logging, distributed tracing, metrics dashboards

Always building. Always shipping.

Popular repositories Loading

  1. streaming-rag-pipeline streaming-rag-pipeline Public

    Python 1

  2. scribe-java scribe-java Public

    Forked from scribejava/scribejava

    Simple OAuth library for Java

    Java

  3. AI AI Public

  4. pjwan2 pjwan2 Public

    Config files for my GitHub profile.

    TypeScript

  5. localindex localindex Public

  6. medical-bot medical-bot Public

    Python