Add tau2-synth-library reward-profiling config + tau2 deps by dpickem · Pull Request #1503 · NVIDIA-NeMo/Gym

dpickem · 2026-06-02T21:31:45Z

Adds a tau2-synth (library domain) config for the verifiers_agent and the tau2 / tau2-synth dependencies so the agent can run the tau2-synth env. Used for reward profiling of Nemotron Nano v3.5 on Polyphe.

copy-pr-bot · 2026-06-02T21:31:48Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

dpickem · 2026-06-02T21:34:32Z

Do not merge this. Instead of adding configs / requirements to the verifier agent, we should be using the new environment abstraction (https://github.com/NVIDIA-NeMo/Gym/tree/main/environments). This is just for testing.

cmunley1 · 2026-06-03T18:34:44Z

For dependency isolation i think we need to do this #1469 rather than use environments/ unless we refactor nemo gym core to put venvs into environments/ or something

Add tau2-synth-library reward-profiling config + tau2 deps

bdef6dc

Adds a tau2-synth (library domain) config for the verifiers_agent and the tau2 / tau2-synth dependencies so the agent can run the tau2-synth env. Used for reward profiling of Nemotron Nano v3.5 on Polyphe.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tau2-synth-library reward-profiling config + tau2 deps#1503

Add tau2-synth-library reward-profiling config + tau2 deps#1503
dpickem wants to merge 1 commit into
mainfrom
dpickem/tau2-synth-reward-profiling

dpickem commented Jun 2, 2026

Uh oh!

copy-pr-bot Bot commented Jun 2, 2026

Uh oh!

dpickem commented Jun 2, 2026

Uh oh!

cmunley1 commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dpickem commented Jun 2, 2026

Uh oh!

copy-pr-bot Bot commented Jun 2, 2026

Uh oh!

dpickem commented Jun 2, 2026

Uh oh!

cmunley1 commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants