Skip to content
View ds-wook's full-sized avatar
🎯
Focusing
🎯
Focusing
  • KT AX Data Lab
  • Seoul, South Korea

Block or report ds-wook

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ds-wook/README.md

Data Scientist · Recommender System · Machine Learning Engineer


🧠 About Me

Data Scientist focused on Recommendation Systems & Ranking

  • 📊 Experienced in ML pipelines, feature engineering, and large-scale data
    processing
  • 🏆 Strong track record in Kaggle & Domestic AI Competitions
  • 🚀 Interested in Real-time personalization & ML system design

🎯 Interests

Machine Learning   Data Mining   Recommendation System


🧑‍💻 Open Source Contributions

🔹 recommenders

  • Integrated LightGBM Ranker into the recommendation ranking pipeline for Learning-to-Rank tasks
  • Developed comprehensive unit tests for the ranking and recommendation pipeline
  • Migrated deep learning models from TensorFlow to PyTorch for improved maintainability and development efficiency

🔹 yamyam-lab

GitHub

  • Developed a review-based restaurant recommendation system using a two-stage recommendation architecture
  • Designed and implemented a ranking model for personalized restaurant recommendations

🔹 pyuba

GitHub

  • Built a Python library for user behavior analysis and event-driven data processing
  • Designed reusable analytics utilities for behavioral data exploration and feature engineering

🧪 Research

🔹 CATabNet

GitHub

  • Proposed an enhanced TabNet model for categorical feature learning by incorporating ordered target statistics inspired by CatBoost
  • Improved categorical feature representation in deep learning models without relying solely on embedding-based encoding approaches
  • Applied CatBoost-style ordered target statistics (Ordered TS) to stabilize training and reduce target leakage in tabular deep learning tasks
  • Conducted research on hybrid boosting–deep learning architectures for high-performance tabular data modeling

🏅 Competition Records

Kaggle

Competition Medal Rank
Riiid Answer Correctness Prediction 🥈 78 / 3395
Ventilator Pressure Prediction 🥉 171 / 2605
Indoor Location Navigation 🥉 72 / 1170
Writing Quality Prediction 🥉 187 / 1876

Domestic

Competition Medal Rank
Dacon Web CTR Prediction 🏆 1 / 772
Credit Card Delinquency Prediction 🏆 2 / 3106
U+ AI Ground 🏆 3 / 658
Toss NEXT ML Challenge 🏅 5 / 709

Pinned Loading

  1. recommenders-team/recommenders recommenders-team/recommenders Public

    Best Practices on Recommendation Systems

    Python 21.7k 3.3k

  2. toss-next-challenge-solution toss-next-challenge-solution Public

    🏅토스 NEXT ML CHALLENGE : 광고 클릭 예측(CTR) 대회 5등 모델 제출용 레포지토리🏅

    Python 25 2

  3. Riiid Riiid Public

    🥈78th place in Riiid solution🥈

    Python 16

  4. ventilator-pressure-prediction ventilator-pressure-prediction Public

    🥉171st place in Google brain solution🥉

    Python 10

  5. writing-quality writing-quality Public

    🥉187th in Linking Writing Processes to Writing Quality solution🥉

    Python 10

  6. lunch-corp/yamyam-lab lunch-corp/yamyam-lab Public

    yamyam ML experiment lab

    Python 6 3