Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: override ep size for benchmark gptoss 120b
#2352 opened May 29, 2026 by thomasdhc Contributor Loading…
3 tasks
feat(diffusion): add Qwen Image Edit training support
#2351 opened May 29, 2026 by pthombre Contributor Loading…
3 tasks
feat(model_init): add FP8 pre-flight check for the force_hf path community-request
#2350 opened May 29, 2026 by stanley1208 Contributor Loading…
2 of 3 tasks
feat(datasets): drop history reasoning_content from agent SFT prompt community-request
#2349 opened May 29, 2026 by khazic Contributor Loading…
3 tasks done
feat(eval): add tool-call accuracy evaluator for agent SFT validation community-request
#2338 opened May 28, 2026 by khazic Contributor Loading…
2 of 3 tasks
feat: falcon_h1 support with unit tests community-request
#2335 opened May 28, 2026 by edjson Contributor Draft
2 of 3 tasks
feat: adding PP and CP for nemotron v3 models
#2316 opened May 25, 2026 by adil-a Collaborator Draft
feat(dllm): add DFlash and LLaDA2 SFT recipes community-request waiting-on-customer Waiting on the original author to respond
#2315 opened May 25, 2026 by kashif Loading…
3 tasks done
docs(retrieval): add fine-tuning guide
#2306 opened May 22, 2026 by oliverholworthy Contributor Draft
3 tasks done
docs(skills): add retrieval models skill
#2305 opened May 22, 2026 by oliverholworthy Contributor Draft
3 tasks done
docs(fern): nest fern infra under docs/, hoist nightly MDX to docs/ top-level
#2291 opened May 21, 2026 by lbliii Contributor Loading…
2 of 3 tasks
fix(checkpoint): harden consolidated safetensors export
#2289 opened May 21, 2026 by yuhezhang-ai Contributor Loading…
3 tasks done
ci: Update transformers to latest version 5.9.0
#2287 opened May 21, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat(diffusion): add Wan2.2 T2V-A14B two-stage finetuning support
#2284 opened May 21, 2026 by linnanwang Contributor Loading…
1 of 3 tasks
feat: Add late interaction model training support for retrieval enhancement New feature or request
#2283 opened May 20, 2026 by rnyak Collaborator Draft
2 of 3 tasks
refactor: registry package split
#2278 opened May 19, 2026 by akoumpa Contributor Draft
3 tasks
feat(bagel): add multimodal Bagel training support
#2275 opened May 18, 2026 by zyzhou5 Contributor Loading…
3 tasks
feat(engine): introduce Engine class; consolidate LLM/VLM recipes
#2269 opened May 18, 2026 by HuiyingLi Contributor Draft
4 of 5 tasks
feat: make mesh accept meshcontext
#2266 opened May 18, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
perf(diffusion): improve Flux training throughput
#2251 opened May 15, 2026 by pthombre Contributor Loading…
3 tasks done
fix(checkpoint): exclude TE _extra_state keys from load-time mismatch warning
#2247 opened May 15, 2026 by adil-a Collaborator Loading…
2 tasks done
feat: Add gemma4 drafter model support
#2240 opened May 15, 2026 by athitten Contributor Loading…
3 tasks
ProTip! Follow long discussions with comments:>50.