Training Tutorials#
We have hands-on tutorials with supported training frameworks to help you train with NeMo Gym environments. If you’re interested in integrating another training framework, see the Training Framework Integration Guide.
Tip
See Training Approaches for a refresher on when to use GRPO, SFT, or DPO.
RL (GRPO)#
Tutorial-series: GRPO training to improve multi-step tool calling on the Workplace Assistant environment, scaling from single-node to multi-node training.
Review the agent executor for using NeMo Gym environments with OpenRLHF.
GRPO training on instruction following and reasoning environments.
Coming soon
Coming soon
Multi-Environment Training#
Run multiple training environments simultaneously for rollout collection.
SFT & DPO#
Transform rollouts into training data for supervised fine-tuning (SFT) and direct preference optimization (DPO).