NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 453
Star 3k

Code
Issues 70
Pull requests 206
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 33 Milestones 0

New pull request New

206 Open 1,203 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix llm_ptq model loading on transformers<4.56 (dtype kwarg)

#1800 opened Jun 23, 2026 by jingyu-ml Contributor

Loading…

Fix ModelOpt MCP Slurm launcher submit

#1799 opened Jun 23, 2026 by ChenhanYu Collaborator

Loading…

[1/2] Deprecate examples/diffusers/eval image-quality evaluation example

#1798 opened Jun 23, 2026 by jingyu-ml Contributor

Loading…

Remove deprecated examples/llm_autodeploy

#1797 opened Jun 22, 2026 by Fridah-nv Contributor

Loading…

Deprecate examples/llm_autodeploy cherry-pick-0.45.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1796 opened Jun 22, 2026 by Fridah-nv Contributor

Loading…

Support INT block scale learning

#1795 opened Jun 22, 2026 by realAsma Contributor • Draft

Skip ComfyUI safetensors post-processing unless opted in (fix sharded FLUX export)

#1794 opened Jun 22, 2026 by jingyu-ml Contributor

Loading…

[Cherry-pick] PRs #1660 #1742 #1740 #1744 #1737 #1669 #1690 #1746 #1750 #1755 #1754 #1761 #1765

#1793 opened Jun 22, 2026 by kevalmorabia97 Collaborator

Loading…

Add Minitron pruning support for VLM language models

#1792 opened Jun 22, 2026 by kevalmorabia97 Collaborator • Draft

Align eval skill AA benchmarks to golden NeMo configs + harden skill

#1790 opened Jun 22, 2026 by cjluo-nv Collaborator

Loading…

[OMNIML-5060] cell_t0_d7

#1789 opened Jun 22, 2026 by ChenhanYu Collaborator • Draft

[OMNIML-5084] cell_t0_d7

#1788 opened Jun 22, 2026 by ChenhanYu Collaborator • Draft

Create adding_new_model_tutorial.md

#1784 opened Jun 22, 2026 by danielkorzekwa Contributor

Loading…

Add: suppot trt-rtx-abi ep

#1783 opened Jun 22, 2026 by haoxiz-nvidia Contributor

Loading…

Add: support input_shape_profile for trt-rtx ep

#1782 opened Jun 22, 2026 by haoxiz-nvidia Contributor

Loading…

Fix low_memory_mode meta-device crash on fused-MoE models

#1781 opened Jun 21, 2026 by abatilo

Loading…

[Chore]: Update Dflash recipes to use dpace

#1775 opened Jun 20, 2026 by h-guo18 Contributor • Draft

[OMNIML-5091] specdec_bench cell t0_d7 — nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 / MTP / trtllm

#1774 opened Jun 19, 2026 by ChenhanYu Collaborator • Draft

Experimental claude skill for puzzletron algoritgm

#1769 opened Jun 18, 2026 by danielkorzekwa Contributor

Loading…

feat(launcher): add Megatron-Bridge quantize/generate/export wrappers

#1767 opened Jun 17, 2026 by yueshen2016 Contributor

Loading…

feat(recipes): add nvfp4_mlp_only-kv_fp8-novit (exclude VL vision tower)

#1760 opened Jun 17, 2026 by Edwardf0t1 Contributor

Loading…

refactor(examples): rename llm_ptq → hf_ptq (symlink for back-compat)

#1759 opened Jun 17, 2026 by Edwardf0t1 Contributor

Loading…

Add p quantization to our triton fa kernel

#1757 opened Jun 16, 2026 by sychen52 Contributor

Loading…

[OMNIML-5003] Support non-gated fused MoE experts (NemotronH) in HF PTQ cherry-pick-0.45.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1756 opened Jun 16, 2026 by jenchen13 Contributor

Loading…

DFlash for MiniMax-M3 (WIP): synthesis thinking-mode mix

#1749 opened Jun 16, 2026 by yeyu-nvidia Contributor • Draft

Previous 1 2 3 4 5 … 8 9 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!