-
Notifications
You must be signed in to change notification settings - Fork 355
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[train] Async batch collation (double-buffering) for the SFT trainer
#1809
opened Jun 18, 2026 by
dyurk-lila
Loading…
2 tasks done
[train] Vectorize controller-side training-batch collation (SFT + RL)
#1808
opened Jun 18, 2026 by
dyurk-lila
Loading…
3 tasks done
[train] Skip building unused per-token loss_fn_outputs when the caller does not consume them
#1807
opened Jun 18, 2026 by
dyurk-lila
Loading…
[megatron] Stream ChunkedDistributedLogprob.backward into a preallocated buffer (lower peak memory)
#1806
opened Jun 18, 2026 by
dyurk-lila
Loading…
[megatron] Accept dtype-string optimizer_config_kwargs (coerce exp_avg_dtype etc. to torch.dtype)
#1805
opened Jun 18, 2026 by
dyurk-lila
Loading…
[chore][logging] Add trajectory and group completion metrics for async RL
#1804
opened Jun 18, 2026 by
SumanthRH
Member
Loading…
1 task
[fully_async] add sample_full_batch: drop zero-variance groups and refill the mini-batch
#1802
opened Jun 17, 2026 by
erictang000
Collaborator
Loading…
[algorithm] make zero-variance filter generic (tolerance + loss-mask aware + metric)
#1801
opened Jun 17, 2026 by
erictang000
Collaborator
Loading…
[chore]Upgrade vllm to 0.23.0
run_h100_gpu_ci
Run H100 GPU CI
run_train_gpu_ci
run_train_megatron_gpu_ci
#1800
opened Jun 17, 2026 by
SumanthRH
Member
Loading…
[Fix] vLLM Metrics Scrapper throughput calculation
#1794
opened Jun 16, 2026 by
zanderjiang
Loading…
rename adv_estimator param to advantage_estimator in compute_advantages_and_returns
#1793
opened Jun 16, 2026 by
KTanmay1
Loading…
1 task
[train] Save HF processor on checkpoint export for VLMs
#1785
opened Jun 14, 2026 by
dinhxuanvu
Contributor
Loading…
1 of 2 tasks
[fix] Honor served_model_name and surface HTTP errors in RemoteInferenceEngine
#1783
opened Jun 13, 2026 by
discobot
Contributor
Loading…
[fix] Use masked mean in advantage batch normalization
#1782
opened Jun 12, 2026 by
discobot
Contributor
Loading…
Support top-K distillation (SDFT/OPSD): teacher top-K sampling + soft-target CE training
#1777
opened Jun 11, 2026 by
atemaguer
Contributor
Loading…
Rollout Routing Replay (R3) for the
fsdp backend, using vllm==0.22
#1772
opened Jun 10, 2026 by
jamesbraza
Contributor
Loading…
Attach the failed actor's log tail when inference engines die during init
#1771
opened Jun 10, 2026 by
jamesbraza
Contributor
Loading…
[megatron] Fused LM-head log-prob + entropy (avoid full [*, seq, vocab] logit materialization)
#1765
opened Jun 9, 2026 by
dyurk-lila
Loading…
4 tasks done
[train] VLM SFT support on Megatron backend (Qwen3-VL)
#1752
opened Jun 4, 2026 by
s-chundi
Loading…
feat(profiler): drive torch.profiler around the training loop
#1750
opened Jun 4, 2026 by
dyurk-lila
Loading…
fix: release slot on CancelledError instead of hard-exit at epoch boundaries
#1733
opened Jun 1, 2026 by
dinhxuanvu
Contributor
Loading…
add validation sample logging
#1713
opened May 28, 2026 by
erictang000
Collaborator
Loading…
2 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.