Skip to content
View duc-dn's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • VNPT AI
  • Hร  Nแป™i
  • 00:28 (UTC -12:00)

Block or report duc-dn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
duc-dn/README.md

๐Ÿ‘‹ Hi, I'm Duc

๐Ÿš€ Data Engineer @ VNPT AI
๐Ÿ”น Building scalable Lakehouse & Data Platforms
๐Ÿ”น Experienced with Big Data, Streaming, and Cloud Infrastructure
๐Ÿ”น Passionate about Data Infrastructure, APIs, and Workflow Orchestration


๐Ÿ”ง Tech Stack

๐Ÿ’ป Programming Languages

  • Python (ETL, APIs, data pipelines, orchestration)
  • Java (Big Data, Kafka, Flink, Spark ecosystem)

๐Ÿ“Š Data & Lakehouse

  • Apache Iceberg, Delta Lake
  • Apache Spark, Apache Flink
  • Kafka, Kafka Connect, Debezium (CDC from Postgres/MySQL/MongoDB)

โ˜๏ธ Cloud & Storage

  • Google BigQuery, Cloud Scheduler
  • AWS S3, MinIO
  • GCS (Google Cloud Storage)

๐Ÿ—„๏ธ Databases & Vector Search

  • PostgreSQL, MySQL, MongoDB
  • Qdrant (Vector Database)

๐Ÿ“ˆ BI & Visualization

  • Apache Superset

๐Ÿ•’ Workflow Orchestration & Scheduling

  • Apache Airflow, Cronjob, Cloud Scheduler

โš™๏ธ DevOps & Infra

  • Docker, Docker Compose
  • Kubernetes, Helm
  • Terraform, GitHub Actions

๐ŸŒ API & Software

  • FastAPI
  • Git, GitHub (version control & collaboration)

๐Ÿ“Œ Featured Projects


๐ŸŒฑ What Iโ€™m Learning

  • Data mesh & federated query engines (Trino/Presto, Dremio)
  • Advanced Iceberg optimizations (partitioning, compaction, metadata scaling)
  • Hybrid pipelines (batch + streaming with Flink + Spark)
  • AI/LLM integration with vector databases (Qdrant)

๐Ÿ“ซ Connect with Me


โญ๏ธ From ducdn

๐Ÿ“ŠGitHub Stats :




Popular repositories Loading

  1. BTL_Java_QLCB BTL_Java_QLCB Public

    Java 2

  2. hudi-cli-with-minio hudi-cli-with-minio Public

    setup Hudi CLI to connect to minio in local

    Shell 2

  3. realtime-analytic realtime-analytic Public

    ฤแป“ รกn xรขy dแปฑng mรด hรฌnh phรขn tรญch vร  xแปญ lรฝ dแปฏ liแป‡u realtime - copyright ducdn

    TypeScript 1

  4. kafka-cluster kafka-cluster Public

    create kafka-cluster with docker-compose

    Python 1

  5. SQL-Server SQL-Server Public

    TSQL

  6. Python_Basics_Tutorial Python_Basics_Tutorial Public

    Forked from CodexploreRepo/python-youtube-tutorials

    Python