Available Environments#

NeMo Gym includes a curated collection of environments for training and evaluation across multiple domains. This page is generated from docs/data/environments.yaml. To update it, run:

python scripts/generate_environments_yaml.py

Example Environment Patterns#

Multi Step example

Multi-step tool calling

example_multi_step.yaml · README

Session State Mgmt example

Session state management (in-memory)

example_session_state_mgmt.yaml · README

Single Tool Call example

Basic single-step tool calling

example_single_tool_call.yaml · README

Environments for Training & Evaluation#

Arc Agi 1 config

knowledge

arc_agi.yamlknowledgevalidation

config arc_agi.yaml

readme README

domain knowledge

Aviary 4 configs

agent coding math

aviary.yamlmathtrainvalidationApache 2.0

config aviary.yaml

readme README

domain math

bixbench_aviary.yamlcoding

config bixbench_aviary.yaml

readme README

domain coding

gsm8k_aviary.yamlmathtrainvalidationApache 2.0

config gsm8k_aviary.yaml

readme README

domain math

hotpotqa_aviary.yamlagenttrainvalidationApache 2.0

config hotpotqa_aviary.yaml

readme README

domain agent

Calendar 1 config

agent

calendar.yamlagenttrainvalidationApache 2.0

config calendar.yaml

readme README

domain agent

dataset Nemotron-RL-agent-calendar_scheduling

Circle Click 1 config

other

circle_click.yamlother

config circle_click.yaml

readme README

domain other

description Click on circles in images

Code Gen 1 config

coding

code_gen.yamlcodingtrainvalidationApache 2.0

config code_gen.yaml

readme README

domain coding

dataset nemotron-RL-coding-competitive_coding

Equivalence Llm Judge 4 configs

agent knowledge

equivalence_llm_judge.yamlknowledge

config equivalence_llm_judge.yaml

readme README

domain knowledge

description Short answer questions with LLM-as-a-judge

value Improve knowledge-related benchmarks like GPQA / HLE

lc.yamlknowledge

config lc.yaml

readme README

domain knowledge

lc_judge.yamlknowledge

config lc_judge.yaml

readme README

domain knowledge

nl2bash-equivalency.yamlagenttrainvalidationGNU General Public License v3.0

config nl2bash-equivalency.yaml

readme README

domain agent

description Short bash command generation questions with LLM-as-a-judge

value Improve foundational bash and IF capabilities

Ether0 1 config

knowledge

ether0.yamlknowledgevalidation

config ether0.yaml

readme README

domain knowledge

description ether0 chemistry benchmark verifiers

value Evalutate chemistry knowledge and reasoning with ether0 benchmark

Genrm Compare 1 config

genrm_compare.yaml—

config genrm_compare.yaml

readme README

Google Search 1 config

agent

google_search.yamlagenttrainApache 2.0

config google_search.yaml

readme README

domain agent

description Multi-choice question answering problems with search tools integrated

value Improve knowledge-related benchmarks with search tools

dataset Nemotron-RL-knowledge-web_search-mcqa

Instruction Following 1 config

instruction_following

instruction_following.yamlinstruction_followingtrainApache 2.0

config instruction_following.yaml

readme README

domain instruction_following

description Instruction following datasets targeting IFEval and IFBench style instruction following capabilities

value Improve IFEval and IFBench

dataset Nemotron-RL-instruction_following

Jailbreak Detection 1 config

safety

jailbreak_detection_nemotron_combined_reward_tp8.yamlsafetyvalidation

config jailbreak_detection_nemotron_combined_reward_tp8.yaml

readme README

domain safety

description Jailbreak detection with Nemotron judge + combined reward

Math Advanced Calculations 1 config

agent

math_advanced_calculations.yamlagenttrainApache 2.0

config math_advanced_calculations.yaml

readme README

domain agent

description An instruction following math environment with counter-intuitive calculators

value Improve instruction following capabilities in specific math environments

dataset Nemotron-RL-math-advanced_calculations

Math Formal Lean 6 configs

math

math_formal_lean.yamlmathtrainMIT

config math_formal_lean.yaml

readme README

domain math

description Lean4 formal proof verification environment