-
Notifications
You must be signed in to change notification settings - Fork 453
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix llm_ptq model loading on transformers<4.56 (dtype kwarg)
#1800
opened Jun 23, 2026 by
jingyu-ml
Contributor
Loading…
[1/2] Deprecate examples/diffusers/eval image-quality evaluation example
#1798
opened Jun 23, 2026 by
jingyu-ml
Contributor
Loading…
Remove deprecated examples/llm_autodeploy
#1797
opened Jun 22, 2026 by
Fridah-nv
Contributor
Loading…
Deprecate examples/llm_autodeploy
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1796
opened Jun 22, 2026 by
Fridah-nv
Contributor
Loading…
Skip ComfyUI safetensors post-processing unless opted in (fix sharded FLUX export)
#1794
opened Jun 22, 2026 by
jingyu-ml
Contributor
Loading…
[Cherry-pick] PRs #1660 #1742 #1740 #1744 #1737 #1669 #1690 #1746 #1750 #1755 #1754 #1761 #1765
#1793
opened Jun 22, 2026 by
kevalmorabia97
Collaborator
Loading…
Add Minitron pruning support for VLM language models
#1792
opened Jun 22, 2026 by
kevalmorabia97
Collaborator
•
Draft
Align eval skill AA benchmarks to golden NeMo configs + harden skill
#1790
opened Jun 22, 2026 by
cjluo-nv
Collaborator
Loading…
Create adding_new_model_tutorial.md
#1784
opened Jun 22, 2026 by
danielkorzekwa
Contributor
Loading…
Add: support input_shape_profile for trt-rtx ep
#1782
opened Jun 22, 2026 by
haoxiz-nvidia
Contributor
Loading…
Fix low_memory_mode meta-device crash on fused-MoE models
#1781
opened Jun 21, 2026 by
abatilo
Loading…
Experimental claude skill for puzzletron algoritgm
#1769
opened Jun 18, 2026 by
danielkorzekwa
Contributor
Loading…
feat(launcher): add Megatron-Bridge quantize/generate/export wrappers
#1767
opened Jun 17, 2026 by
yueshen2016
Contributor
Loading…
feat(recipes): add nvfp4_mlp_only-kv_fp8-novit (exclude VL vision tower)
#1760
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
refactor(examples): rename llm_ptq → hf_ptq (symlink for back-compat)
#1759
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
Add p quantization to our triton fa kernel
#1757
opened Jun 16, 2026 by
sychen52
Contributor
Loading…
[OMNIML-5003] Support non-gated fused MoE experts (NemotronH) in HF PTQ
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1756
opened Jun 16, 2026 by
jenchen13
Contributor
Loading…
DFlash for MiniMax-M3 (WIP): synthesis thinking-mode mix
#1749
opened Jun 16, 2026 by
yeyu-nvidia
Contributor
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.