Skip to content
View ambuj991's full-sized avatar

Block or report ambuj991

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ambuj991/README.md

Hello World! I'm Ambuj Hakhu

Typing SVG

LinkedIn Email GitHub Location

Profile Views


🧠 Applied AI Scientist & ML Engineer

AI Animation

I specialize in designing and deploying enterprise-grade Generative AI and LLM systems. Currently, I'm an Applied AI Scientist, where I bridge the gap between cutting-edge research and production-scale security applications.

  • πŸ”¬ Model Distillation – Successfully distilled Claude 3.5 Sonnet into a local Qwen3-0.6B model, achieving a 300Γ— latency reduction.
  • πŸ›‘οΈ AI for Security – Invented StackPrint (patent pending), a system clustering 3.5M+ vulnerability records with 87.5% coverage.
  • πŸš€ Distributed Training – Multi-GPU optimization using DeepSpeed, FSDP, and Megatron-LM.
  • πŸ› οΈ Infrastructure – Scalable AI via Docker, Kubernetes, and high-throughput inference APIs.

πŸ› οΈ Tech Stack

ML / AI
LLM Ops
Cloud & Data

πŸš€ Featured Work

πŸ“¦ RuFus β€” AI-Powered Web Extraction

https://github.com/ambuj991/rufus

A Python package for intelligent web extraction tailored for RAG pipelines.

  • Natural language aware web data extraction
  • Designed for LLM-powered retrieval pipelines
  • Compatible with LangChain & LlamaIndex
  • Supports headless rendering for complex websites

πŸ“¦ Package available on TestPyPI:
https://test.pypi.org/project/rufus-ai-web-extraction/


πŸ›‘οΈ StackPrint (Patent Pending)

Enterprise deployment-template detection system.

  • Clustered 3.5M+ vulnerability records
  • Identified provisioning groups automatically
  • Reduced manual patch management by 96.3%

⚑ Local LLM Distillation

  • Distilled Claude 3.5 Sonnet β†’ Qwen3-0.6B
  • Trained using Unsloth-accelerated LoRA
  • Achieved 93.5% task accuracy
  • Replaced expensive cloud API inference

πŸ“ˆ Expertise Map

graph LR
    A[Applied AI] --> B[Model Optimization]
    A --> C[Scalable RAG]
    B --> B1[Quantization]
    B --> B2[LoRA Fine-tuning]
    B --> B3[Distillation]
    C --> C1[VectorDBs]
    C --> C2[Knowledge Graphs]
    A --> D[MLOps]
    D --> D1[Distributed Training]
    D --> D2[Inference Serving]

    style A fill:#2196F3,stroke:#fff,stroke-width:2px,color:#fff
Loading

πŸŽ“ Education

University of Cincinnati

M.S. Computer Science (NLP & ML)

GPA: 3.8/4.0

SVKM’s NMIMS University

B.Tech Electronics & Telecommunication

Mumbai, India


πŸ“« Let's Connect

email linkedin github

Quote

Made with ❀️ in San Jose

Popular repositories Loading

  1. ambuj991 ambuj991 Public

    Config files for my GitHub profile.

  2. Leetcode-Patterns Leetcode-Patterns Public

    Solutions to leetcode problems

    Python

  3. Projects Projects Public

    Jupyter Notebook

  4. Analyzing-TV-Data Analyzing-TV-Data Public

    Jupyter Notebook

  5. Predicting-credit-card-approvals Predicting-credit-card-approvals Public

    Jupyter Notebook

  6. The-andriod-app-market-on-Google-play The-andriod-app-market-on-Google-play Public

    Jupyter Notebook