-
Notifications
You must be signed in to change notification settings - Fork 375
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379
opened Apr 30, 2026 by
ChenhanYu
Collaborator
Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378
opened Apr 30, 2026 by
dthienan-nv
Contributor
Loading…
Enable active-param and memory based Minitron pruning constraint
#1377
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
Loading…
1 task
Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment
#1376
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
•
Draft
Fix sparsity-only export emitting empty hf_quant_config.json
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1375
opened Apr 29, 2026 by
kaix-nv
Contributor
Loading…
Add closed-form MXFP4 -> NVFP4 weight cast (--cast_mxfp4_to_nvfp4)
#1372
opened Apr 29, 2026 by
cjluo-nv
Collaborator
Loading…
5 tasks done
fix: guard against None chat_template in _post_process_chat_template
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1371
opened Apr 29, 2026 by
yeyu-nvidia
Contributor
Loading…
fix: include medusa in data_module assignment in main.py
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1370
opened Apr 29, 2026 by
yeyu-nvidia
Contributor
Loading…
Added fallback to load extra cudnn dlls in the site packages
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1369
opened Apr 29, 2026 by
hthadicherla
Contributor
Loading…
Fix dynamic block quantizer detection & MSE MOE calibration; Add Nemotron Super v3 NVFP4 PTQ recipe
#1363
opened Apr 28, 2026 by
jenchen13
Contributor
Loading…
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1356
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
2 tasks done
[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1353
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327
opened Apr 22, 2026 by
ajrasane
Contributor
Loading…
3 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.