NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 375
Star 2.6k

Code
Issues 56
Pull requests 149
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 32 Milestones 0

New pull request New

149 Open 874 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

feat(launcher): add DFlash support for DeepSeek-V4-Flash target model

#1379 opened Apr 30, 2026 by ChenhanYu Collaborator

Loading…

Use trtexec_safe on safety platforms when using remoteAutoTuning

#1378 opened Apr 30, 2026 by dthienan-nv Contributor

Loading…

Enable active-param and memory based Minitron pruning constraint

#1377 opened Apr 30, 2026 by kevalmorabia97 Collaborator

Loading…

1 task

Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment

#1376 opened Apr 30, 2026 by kevalmorabia97 Collaborator • Draft

Fix sparsity-only export emitting empty hf_quant_config.json cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1375 opened Apr 29, 2026 by kaix-nv Contributor

Loading…

Add closed-form MXFP4 -> NVFP4 weight cast (--cast_mxfp4_to_nvfp4)

#1372 opened Apr 29, 2026 by cjluo-nv Collaborator

Loading…

5 tasks done

fix: guard against None chat_template in _post_process_chat_template cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1371 opened Apr 29, 2026 by yeyu-nvidia Contributor

Loading…

fix: include medusa in data_module assignment in main.py cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1370 opened Apr 29, 2026 by yeyu-nvidia Contributor

Loading…

Added fallback to load extra cudnn dlls in the site packages cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1369 opened Apr 29, 2026 by hthadicherla Contributor

Loading…

k25 dflash hardcode support

#1367 opened Apr 29, 2026 by h-guo18 Contributor • Draft

[Fix]: $HOME in launcher eagle example

#1365 opened Apr 28, 2026 by h-guo18 Contributor

Loading…

Experiment: MXFP4 -> NVFP4 conversion MSE study (scratch)

#1364 opened Apr 28, 2026 by cjluo-nv Collaborator • Draft

3 tasks

Fix dynamic block quantizer detection & MSE MOE calibration; Add Nemotron Super v3 NVFP4 PTQ recipe

#1363 opened Apr 28, 2026 by jenchen13 Contributor

Loading…

[SKILL.md Chore] make .agents/ the cannonical agent-skills location

#1362 opened Apr 28, 2026 by shljessie

Loading…

Enable runtime optimization

#1358 opened Apr 28, 2026 by grzegorz-k-karch Contributor • Draft

Add pre-built evaluation recipes for common benchmarks

#1357 opened Apr 27, 2026 by kaix-nv Contributor

Loading…

[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1356 opened Apr 27, 2026 by ajrasane Contributor

Loading…

2 tasks done

[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1353 opened Apr 27, 2026 by ajrasane Contributor

Loading…

[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour

#1345 opened Apr 24, 2026 by shengliangxu Collaborator

Loading…

3 tasks done

[minor] fixes for layerwise calib + MSE

#1344 opened Apr 24, 2026 by Fridah-nv Contributor

Loading…

DSV4 dequant on the fly

#1341 opened Apr 24, 2026 by mxinO Contributor • Draft

Update

#1338 opened Apr 23, 2026 by jingyu-ml Contributor • Draft

[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility

#1333 opened Apr 23, 2026 by jenchen13 Contributor

Loading…

[Refactor] speculative decoding: use mto config subsystem

#1328 opened Apr 23, 2026 by h-guo18 Contributor

Loading…

Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe

#1327 opened Apr 22, 2026 by ajrasane Contributor

Loading…

3 of 5 tasks

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!