docs: document the Gym to RL framework token-ID data interface by ananthsub · Pull Request #1554 · NVIDIA-NeMo/Gym

ananthsub · 2026-06-10T01:34:46Z

Follow up to #1545

Add a Data Interface section to the on-policy corrections page describing what token ID fields the model server returns during training (prompt_token_ids, generation_token_ids, generation_log_probs), where they attach, and the rule that the message-level token IDs are the single source of truth: they are produced once by the model server and propagated turn-to-turn on the messages, and callers must not construct or inject prefix token IDs out of band

copy-pr-bot · 2026-06-10T01:34:50Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Add a Data Interface section to the on-policy corrections page describing what token-ID fields the model server returns during training (prompt_token_ids, generation_token_ids, generation_log_probs), where they attach, and the rule that the message-level token IDs are the single source of truth: they are produced once by the model server and propagated turn-to-turn on the messages, and callers must not construct or inject prefix token IDs out of band. Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

copy-pr-bot · 2026-06-10T17:22:23Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions · 2026-06-16T13:35:02Z

🌿 Preview your docs: https://nvidia-preview-ananthsub-docs-gym-rl-data-contract.docs.buildwithfern.com/nemo/gym

Here are the markdown pages you've updated:

nemo/gym/contribute/rl-framework-integration/openai-compatible-http-server-on-policy-correction

ananthsub requested review from bxyu-nvidia and cmunley1 June 10, 2026 01:34

ananthsub added documentation Improvements to documentation training Training framework integrations labels Jun 10, 2026

bxyu-nvidia approved these changes Jun 10, 2026

View reviewed changes

ananthsub marked this pull request as ready for review June 10, 2026 16:15

ajaymittur mentioned this pull request Jun 10, 2026

fix(vllm_model): preserve required_prefix_token_ids #1545

Closed

ananthsub force-pushed the ananthsub/docs-gym-rl-data-contract branch from 28f87d5 to a60d730 Compare June 10, 2026 17:22

Merge branch 'main' into ananthsub/docs-gym-rl-data-contract

c86369e

ananthsub merged commit 604cc86 into NVIDIA-NeMo:main Jun 16, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: document the Gym to RL framework token-ID data interface#1554

docs: document the Gym to RL framework token-ID data interface#1554
ananthsub merged 2 commits into
NVIDIA-NeMo:mainfrom
ananthsub:ananthsub/docs-gym-rl-data-contract

ananthsub commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ananthsub commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants