NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 337
Star 2.4k

Code
Issues 60
Pull requests 124
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 30 Milestones 0

New pull request New

124 Open 742 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

add: DFlash block diffusion speculative decoding

#1211 opened Apr 8, 2026 by ChenhanYu

Loading…

Replace in-repo LLM ONNX export with TensorRT-Edge-LLM

#1210 opened Apr 8, 2026 by ajrasane

Loading…

Upgrade ONNX from 1.19 to 1.21

#1207 opened Apr 8, 2026 by ajrasane

Loading…

Consolidate lm-eval scripts: merge AnyModel auto-detection into lm_eval_hf.py

#1206 opened Apr 8, 2026 by j-rausch

Loading…

Add Z-Image (NextDiT/Lumina2) PTQ quantization support in diffusers example

#1205 opened Apr 8, 2026 by andrea-pilzer

Loading…

[1/N] Polish PTQ skills

#1198 opened Apr 8, 2026 by Edwardf0t1

Loading…

Add support for postprocess exported model for block scale swizzling and support for different padding strategy

#1195 opened Apr 8, 2026 by ynankani

Loading…

fix: handle accelerate CPU-offloaded models in FakeQuant export

#1194 opened Apr 8, 2026 by sungsooha

Loading…

Validate non-empty cfg when enabling quantizers in quant_cfg

#1192 opened Apr 7, 2026 by shengliangxu

Loading…

Simplify KDTrainer and enhance ModelOptHFTrainer

#1191 opened Apr 7, 2026 by realAsma

Loading…

4 of 6 tasks

Add ModelOpt Triton attention kernels for WAN2.2 diffusion (sparse, skip-softmax, NVFP4)

#1190 opened Apr 7, 2026 by yeyu-nvidia

Loading…

5 tasks

Generic Fused MoE Quantization + Export for transformers 5.0+

#1187 opened Apr 7, 2026 by Edwardf0t1

Loading…

2 of 3 tasks

[chore]: weekly bump of uv.lock on main (2026-04-06)

#1180 opened Apr 6, 2026 by github-actions bot

Loading…

GPTQ test

#1179 opened Apr 6, 2026 by sugunav14 • Draft

feat: parallelize fakequant export across GPUs via ThreadPoolExecutor

#1177 opened Apr 3, 2026 by sungsooha

Loading…

[1/N] Refactor llm_qat example: YAML configs + ModelOptArgParser

#1172 opened Apr 2, 2026 by realAsma

Loading…

3 of 4 tasks

[minor] add a general FP8ScaleSweepCalibrator and its registry

#1171 opened Apr 2, 2026 by Fridah-nv

Loading…

Refactor Qwen3.5 MoE quantization to use _QuantFunctionalMixin

#1170 opened Apr 2, 2026 by cjluo-nv

Loading…

4 tasks

Add the Skip softmax for diffusion

#1166 opened Apr 2, 2026 by jingyu-ml

Loading…

recipes doc

#1165 opened Apr 2, 2026 by shengliangxu • Draft

[NVBug 6045859]Fix export support for Qwen3VL MoE experts

#1164 opened Apr 1, 2026 by shengliangxu

Loading…

Added support for MoE for vllm >= 0.14.0rc1

#1162 opened Apr 1, 2026 by kinjalpatel27

Loading…

Fix[bug] ONNX models generated by llm_export.py are missing some i/o

#1157 opened Apr 1, 2026 by Ratheesh1104

Loading…

fix spelling errors

#1153 opened Apr 1, 2026 by noeyy-mino

Loading…

Intermediate checkpointing for sequential calibration

#1152 opened Mar 31, 2026 by sugunav14

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!