Skip to content

Commit b5bc23e

Browse files
sergiopaniegoxuebwang-amd
authored andcommitted
Add TRL example notebook to RLHF docs (vllm-project#26346)
Signed-off-by: sergiopaniego <sergiopaniegoblanco@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
1 parent 49a7bf0 commit b5bc23e

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/training/rlhf.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,4 +12,5 @@ See the following basic examples to get started if you don't want to use an exis
1212

1313
See the following notebooks showing how to use vLLM for GRPO:
1414

15+
- [Efficient Online Training with GRPO and vLLM in TRL](https://huggingface.co/learn/cookbook/grpo_vllm_online_training)
1516
- [Qwen-3 4B GRPO using Unsloth + vLLM](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(4B)-GRPO.ipynb)

0 commit comments

Comments
 (0)