Skip to content
View yaogdu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • a
  • 中国

Block or report yaogdu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yaogdu/README.md

👋 Hi, I'm Toby Du

Better late than never.

GitHub followers Location Focus


🚀 About Me

I'm a backend architect and engineering leader with 10+ years of experience building distributed systems, observability platforms, and enterprise-grade AI infrastructure.

  • 🔭 Currently focused on AI-powered tools, AgentOps, RAG workflows, and the MCP ecosystem
  • 🏗️ Experienced in designing and scaling distributed systems supporting 10M+ DAU
  • ⚡ Passionate about high concurrency, high availability, reliability engineering, and developer efficiency
  • 👥 7+ years of experience in team leadership, technical governance, and cross-functional delivery
  • 💡 Interested in knowledge management, productivity tools, fintech, and developer platforms
  • 📍 Based in Beijing, China

🧠 What I Do

  • Build high-performance backend systems with strong stability and observability
  • Design cloud-native architectures for large-scale business platforms
  • Develop enterprise AI capabilities with LangGraph, AutoGen, MCP, RAG, and AgentOps
  • Bridge technical architecture and product goals to deliver systems that create real business value

🛠️ Tech Stack

Languages

Java Go Python

Backend & Infra

Spring Boot MySQL Redis Message Queue Docker Kubernetes

Cloud & Observability

AWS Alibaba Cloud Prometheus Grafana N9E

AI Engineering

LangGraph AutoGen MCP RAG AgentOps

🏆 Selected Highlights

  • Scalable Architecture Expert — 10+ years in backend engineering, with hands-on experience designing distributed systems for 10M+ DAU
  • Reliability Driven — built and scaled platforms with 99.9%+ availability, strong observability, and minute-level alerting
  • Polyglot Engineer — advanced proficiency in Java (Spring Boot), Go, and Python
  • Technical Leadership — 7+ years leading teams, improving R&D workflows, and building engineering governance
  • AI Infrastructure Builder — experienced in enterprise-grade Agents, RAG workflows, MCP integrations, and AgentOps platforms
  • Product-Minded Architect — able to turn complex technical requirements into practical systems that drive business outcomes

💼 Experience Snapshot

  • Orange Digital Technology — Led architecture design and built the company-wide observability stack with Prometheus, SLS, Grafana, and N9E, achieving 99.9% uptime and minute-level monitoring
  • Huanxin Network — Built the business middle platform from scratch, supporting 6 business lines, millions of DAU, and 99.99% system stability
  • JD Technology — Supported large-scale marketing and coupon systems during major promotional campaigns with millions of QPS
  • VIPKID — Led booking and class-management architecture, supporting 100,000 peak QPS and 10+ collaborating teams
  • Duolabao — Designed core payment systems, OAuth2 open platform, data synchronization middleware, and big data capabilities

📦 Featured Projects

SkillHub: Enterprise AI Asset Distribution & AgentOps Infrastructure

A unified distribution system for Prompts, Agent scripts, and MCP Servers, built around the npx interaction paradigm with enterprise-grade enhancements.

  • Standardized package format for Prompts / Agents / MCPs
  • Automated runtime isolation with Python venv and Node modules
  • Shadow path mapping via symlink orchestration
  • Deep integration with Anthropic MCP
  • Offline-ready versioned snapshotting and metadata indexing
  • Static auditing pipeline for high-risk command filtering

Enterprise Stability & Observability Revitalization

A company-wide observability initiative designed to shift engineering from reactive firefighting to proactive system management.

  • Built a unified observability platform across Java, Go, and Python
  • Developed an AI-driven Alert Gateway with alert convergence and automated ACK workflows
  • Implemented graceful startup/shutdown protocols to reduce deployment failures
  • Drove adoption of standardized monitoring and on-call governance
  • Reached 85% monitoring coverage and minute-level anomaly detection

📈 GitHub Activity

💭 Random Dev Quote

📫 Let's Connect

Popular repositories Loading

  1. datax datax Public

    es plugin of datax for copying mysql data to elasticsearch

    Java 10 3

  2. insurance insurance Public

    simple 「mind map」of insurance. 简单的保险思维导图

    2

  3. rpc rpc Public

    rpc framework

    Java 1

  4. elasticsearch-mysql elasticsearch-mysql Public

    Forked from GongDexing/elasticsearch-mysql

    简单实用的同步工具,实现mysql数据库中数据定期同步到elasticsearch,只需简单的配置,便能达到非凡的效果,支持elasticsearch 5.X版本

    Java 1 1

  5. goBook goBook Public

    Forked from xionglun/goBook

    Go语言学习笔记

    Go 1

  6. golang-examples golang-examples Public

    Forked from binque/golang-examples

    Golang 辅助手册是我在自己的学习过程中查阅资料的总结以及一些自己的想法的文章合集,它们或是自己的总结,或是别人优秀的入门教程的翻译,希望对你的Golang入门有所帮助,另外,有任何错误或者兴合理的内容还希望你能在第一时间告诉我。

    Go 1