Ambuj Hakhu ambuj991

Hello World! I'm Ambuj Hakhu

🧠 Applied AI Scientist & ML Engineer

I specialize in designing and deploying enterprise-grade Generative AI and LLM systems. Currently, I'm an Applied AI Scientist, where I bridge the gap between cutting-edge research and production-scale security applications.

🔬 Model Distillation – Successfully distilled Claude 3.5 Sonnet into a local Qwen3-0.6B model, achieving a 300× latency reduction.
🛡️ AI for Security – Invented StackPrint (patent pending), a system clustering 3.5M+ vulnerability records with 87.5% coverage.
🚀 Distributed Training – Multi-GPU optimization using DeepSpeed, FSDP, and Megatron-LM.
🛠️ Infrastructure – Scalable AI via Docker, Kubernetes, and high-throughput inference APIs.

🛠️ Tech Stack

ML / AI
LLM Ops
Cloud & Data

🚀 Featured Work

📦 RuFus — AI-Powered Web Extraction

https://github.com/ambuj991/rufus

A Python package for intelligent web extraction tailored for RAG pipelines.

Natural language aware web data extraction
Designed for LLM-powered retrieval pipelines
Compatible with LangChain & LlamaIndex
Supports headless rendering for complex websites

📦 Package available on TestPyPI:
https://test.pypi.org/project/rufus-ai-web-extraction/

🛡️ StackPrint (Patent Pending)

Enterprise deployment-template detection system.

Clustered 3.5M+ vulnerability records
Identified provisioning groups automatically
Reduced manual patch management by 96.3%

⚡ Local LLM Distillation

Distilled Claude 3.5 Sonnet → Qwen3-0.6B
Trained using Unsloth-accelerated LoRA
Achieved 93.5% task accuracy
Replaced expensive cloud API inference

📈 Expertise Map

graph LR
    A[Applied AI] --> B[Model Optimization]
    A --> C[Scalable RAG]
    B --> B1[Quantization]
    B --> B2[LoRA Fine-tuning]
    B --> B3[Distillation]
    C --> C1[VectorDBs]
    C --> C2[Knowledge Graphs]
    A --> D[MLOps]
    D --> D1[Distributed Training]
    D --> D2[Inference Serving]

    style A fill:#2196F3,stroke:#fff,stroke-width:2px,color:#fff

🎓 Education

University of Cincinnati

M.S. Computer Science (NLP & ML)

GPA: 3.8/4.0

SVKM’s NMIMS University

B.Tech Electronics & Telecommunication

Mumbai, India

📫 Let's Connect

Made with ❤️ in San Jose

Provide feedback

Saved searches

Use saved searches to filter your results more quickly