-
Notifications
You must be signed in to change notification settings - Fork 313
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[models] add nemotron 30b nano run scripts
#1612
opened May 2, 2026 by
erictang000
Collaborator
Loading…
[train] SFT 2/N: Support training on all assistant messages, add HF export support and
num_epochs
#1611
opened May 2, 2026 by
SumanthRH
Member
Loading…
[train] Support streaming mini-batch (non-blocking async training)
#1607
opened May 1, 2026 by
rishithayenumula
Loading…
[models] Add nemotron-nano-30b-a3b to CI
#1603
opened Apr 30, 2026 by
erictang000
Collaborator
Loading…
[train] Use custom wheel for vllm-router for
/chat/completions fix
run_train_gpu_ci
#1601
opened Apr 30, 2026 by
SumanthRH
Member
Loading…
Add FP8 text-to-SQL training script; drop SQLEnv stop-tag assertion
#1599
opened Apr 30, 2026 by
hao-aaron
Collaborator
Loading…
Fix GPU assignment for Slurm-launched Ray clusters
#1592
opened Apr 29, 2026 by
agolajko
Contributor
Loading…
Fix AssertionError during eval when val set size is not divisible by train_batch_size
#1589
opened Apr 29, 2026 by
rishithayenumula
Loading…
[feat] Multi-LoRA serving for RemoteInferenceClient
#1579
opened Apr 28, 2026 by
hao-aaron
Collaborator
Loading…
[train] Fix rollout metrics for step-wise and custom generators (sync / fully async)
#1556
opened Apr 22, 2026 by
CharlieFRuan
Member
•
Draft
1 of 3 tasks
[WIP] Add changes needed for FP8 megatron training
#1543
opened Apr 21, 2026 by
pcmoritz
Collaborator
Loading…
[train][step-wise] Three correctness/efficiency fixes for step-wise training
#1539
opened Apr 20, 2026 by
CharlieFRuan
Member
Loading…
4 tasks done
[skyrl] Preserve staged forward_backward loss_fn_outputs across DP ranks
#1534
opened Apr 19, 2026 by
taivu1998
Loading…
Modify SkyRL Generator to Append Router Indices in Multi-Turn
#1530
opened Apr 17, 2026 by
devpatelio
Collaborator
Loading…
Add FoldGRPO advantage estimator and process_rewards pipeline
#1514
opened Apr 15, 2026 by
sumi-fleet-hub
Loading…
SFT loss aggregation consistent with RL path
#1513
opened Apr 15, 2026 by
agolajko
Contributor
Loading…
fix(docker): optimize Dockerfile.megatron to reduce image size by 1.36 GB
run_train_megatron_gpu_ci
#1499
opened Apr 11, 2026 by
dinhxuanvu
Loading…
feat: add max_tokens_per_microbatch config for token-based micro-batching
#1477
opened Apr 8, 2026 by
erictang000
Collaborator
Loading…
feat: native Atropos-SHM integration and modular ingestion layer
#1473
opened Apr 7, 2026 by
RUFFY-369
Loading…
[train] Enable expandable_segments to reduce GPU memory fragmentation
run_train_gpu_ci
#1470
opened Apr 7, 2026 by
CharlieFRuan
Member
•
Draft
5 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.