Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add: DFlash block diffusion speculative decoding
#1211 opened Apr 8, 2026 by ChenhanYu Loading…
Upgrade ONNX from 1.19 to 1.21
#1207 opened Apr 8, 2026 by ajrasane Loading…
[1/N] Polish PTQ skills
#1198 opened Apr 8, 2026 by Edwardf0t1 Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191 opened Apr 7, 2026 by realAsma Loading…
4 of 6 tasks
Generic Fused MoE Quantization + Export for transformers 5.0+
#1187 opened Apr 7, 2026 by Edwardf0t1 Loading…
2 of 3 tasks
GPTQ test
#1179 opened Apr 6, 2026 by sugunav14 Draft
[1/N] Refactor llm_qat example: YAML configs + ModelOptArgParser
#1172 opened Apr 2, 2026 by realAsma Loading…
3 of 4 tasks
Refactor Qwen3.5 MoE quantization to use _QuantFunctionalMixin
#1170 opened Apr 2, 2026 by cjluo-nv Loading…
4 tasks
Add the Skip softmax for diffusion
#1166 opened Apr 2, 2026 by jingyu-ml Loading…
recipes doc
#1165 opened Apr 2, 2026 by shengliangxu Draft
Added support for MoE for vllm >= 0.14.0rc1
#1162 opened Apr 1, 2026 by kinjalpatel27 Loading…
fix spelling errors
#1153 opened Apr 1, 2026 by noeyy-mino Loading…
ProTip! Add no:assignee to see everything that’s not assigned.