-
Notifications
You must be signed in to change notification settings - Fork 361
Pull requests: NVIDIA-NeMo/Megatron-Bridge
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(deepseek): gate H100 fused kernel defaults
area:perf
Performance optimizations and benchmarking
bug
Something isn't working
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#4338
opened Jun 12, 2026 by
cuichenx
Contributor
Loading…
cp: [recipe,model,ckpt] DeepSeek-V4 backports (4131, 4271, 4305, 4306, 4338) into r0.5.0
area:recipe
Training recipes and launch configs
cherry-pick
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#4337
opened Jun 12, 2026 by
cuichenx
Contributor
Loading…
cp: Performance optimizations and benchmarking
cherry-pick
docs
Documentation-only updates or documentation debt
docs-only
With great power comes great responsibility.
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
Run CICD
docs(deepseek): update H100 Flash parallelism (4334) into r0.5.0
area:perf
#4336
opened Jun 12, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
cp: Training recipes and launch configs
bug
Something isn't working
cherry-pick
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
Run CICD
[examples] fix: Make Qwen2-Audio SFT runnable by default (4313) into r0.5.0
area:recipe
#4335
opened Jun 12, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
Add functional support matrix to README
area:misc
Cross-cutting utilities, logging, helpers, and other changes
docs
Documentation-only updates or documentation debt
docs-only
With great power comes great responsibility.
needs-review
PR is ready for code review and waiting on a reviewer
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#4333
opened Jun 12, 2026 by
snowmanwwg
Contributor
Loading…
5 tasks
[model] fix: adapt MCore dev attention gates
area:model
Model implementations and HF bridge logic
bug
Something isn't working
full-test-suite
needs-review
PR is ready for code review and waiting on a reviewer
#4326
opened Jun 12, 2026 by
yaoyu-33
Contributor
Loading…
docs: Add initial fern migration
area:misc
Cross-cutting utilities, logging, helpers, and other changes
docs
Documentation-only updates or documentation debt
needs-review
PR is ready for code review and waiting on a reviewer
#4325
opened Jun 12, 2026 by
chtruong814
Contributor
Loading…
5 tasks
chore(beep boop 🤖): Bump
uv.lock (main, mcore-dev) (2026-06-12)
full-test-suite
#4324
opened Jun 12, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix(training): run K8s trainer from data-mover-synced code
area:training
Training loop, callbacks, and runtime integration
bug
Something isn't working
26.06 perf summary
26.06
docs
Documentation-only updates or documentation debt
docs-only
With great power comes great responsibility.
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#4318
opened Jun 12, 2026 by
malay-nagda
Contributor
•
Draft
5 tasks
Fix nemotronLabsDiffusion checkpoint conversion
area:diffusion
DFM module
bug
Something isn't working
needs-more-tests
Requires additional L0 and L1 test coverage before merge
r0.5.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#4317
opened Jun 12, 2026 by
sajadn
Contributor
Loading…
5 tasks
fix+feat(gemma2): fix SWA correctness bugs and add FlexAttention fused softcap+SWA path
area:model
Model implementations and HF bridge logic
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
#4308
opened Jun 11, 2026 by
nvegesna-netizen
Contributor
Loading…
feat(data): add text chat collate for unified HF datasets
area:data
Dataset builders, preprocessing, and samplers
feature
New capabilities, enhancements, or enablement work
needs-more-tests
Requires additional L0 and L1 test coverage before merge
needs-review
PR is ready for code review and waiting on a reviewer
#4307
opened Jun 11, 2026 by
yaoyu-33
Contributor
Loading…
[model, feature] qwen3-omni: add packed sequence support and shared sequence utilities
area:model
Model implementations and HF bridge logic
community-request
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
#4304
opened Jun 11, 2026 by
hbhflw2000
Contributor
Loading…
chore(beep boop 🤖): Bump
uv.lock (main, mcore-dev) (2026-06-11)
full-test-suite
#4303
opened Jun 11, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat(model): thread weight_dtype through HF export for plain-dtype DeepSeek-V4 output
area:model
Model implementations and HF bridge logic
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
#4301
opened Jun 11, 2026 by
Meirtz
Member
Loading…
mirror: [model] fix: apply attention_value_scale to V in MiMo-V2-Flash
area:model
Model implementations and HF bridge logic
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
#4299
opened Jun 11, 2026 by
ko3n1g
Contributor
Loading…
mirror: feat: Add EXAONE 4.0 model bridge (LG AI Research)
area:model
Model implementations and HF bridge logic
community-request
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
#4298
opened Jun 11, 2026 by
ko3n1g
Contributor
Loading…
mirror: feat(model): add Gemma-4 E4B support (layer spec, checkpoint loader, parity check)
area:model
Model implementations and HF bridge logic
feature
New capabilities, enhancements, or enablement work
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#4297
opened Jun 11, 2026 by
ko3n1g
Contributor
Loading…
mirror: fix(gemma4): map dense MLP pre-FFN norm to pre_feedforward_layernorm
area:model
Model implementations and HF bridge logic
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
#4296
opened Jun 11, 2026 by
ko3n1g
Contributor
Loading…
refactor(docker): Dockerfile configurations and enhance environment variable management
area:build
Dependencies, packaging, images, and environment setup
bug
Something isn't working
needs-review
PR is ready for code review and waiting on a reviewer
#4288
opened Jun 10, 2026 by
balasaajay
Contributor
Loading…
5 tasks
feat(diffusion): add LongLive WAN training path
area:diffusion
DFM module
community-request
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
#4272
opened Jun 10, 2026 by
AndysonYs
Loading…
3 of 5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.