Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

MFU tracking for inference
#3856 opened Mar 13, 2026 by tdene Draft
5 tasks
enable async save for functional tests complexity: low Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. Run functional tests
#3855 opened Mar 13, 2026 by dimapihtar Loading…
5 tasks
Core 0.16
remove legacy mpu complexity: low Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review.
#3854 opened Mar 13, 2026 by dimapihtar Loading…
5 tasks
Core 0.16
remove legacy data complexity: high Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review.
#3853 opened Mar 13, 2026 by dimapihtar Loading…
5 tasks
Core 0.16
[DEV] fix(megatron-fsdp): build expt_device_mesh only for MoE models dev branch Dev branch related issues and development Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. module: megatron-fsdp
#3832 opened Mar 12, 2026 by xuwchen Loading…
5 tasks
Core 0.16
fix(megatron-fsdp): build expt_device_mesh only for MoE models Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. module: megatron-fsdp
#3831 opened Mar 12, 2026 by xuwchen Draft
5 tasks
Core 0.16
[dev] mHC kernel fusion complexity: high
#3828 opened Mar 12, 2026 by jingqiny-99 Loading…
2 of 5 tasks
Core 0.16
chore: Move to Py3.12
#3826 opened Mar 12, 2026 by ko3n1g Draft
5 tasks
Core 0.16
feat(fsdp): use TE general_gemm for mixed-precision wgrad in FSDP path complexity: low Final Review PR is in the "final review" stage
#3822 opened Mar 12, 2026 by Victarry Loading…
2 tasks done
Core 0.16
Add Lion optimizer support complexity: low Final Review PR is in the "final review" stage
#3813 opened Mar 11, 2026 by mchrzanowski Loading…
ProTip! Filter pull requests by the default branch with base:main.