OpenRLHF

community

https://github.com/OpenRLHF

AI & ML interests

None defined yet.

Recent Activity

chuyi777 updated a dataset 28 days ago

OpenRLHF/aime-2024

chuyi777 updated a dataset 28 days ago

OpenRLHF/dapo-math-17k

chuyi777 authored a paper about 1 month ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

View all activity

models 10

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated Jul 28, 2025 • 90 • 3

OpenRLHF/Llama-3-8b-rm-mixture

8B • Updated Nov 30, 2024 • 24 • 1

OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt

7B • Updated Nov 30, 2024 • 5 • 1

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

7B • Updated Oct 30, 2024 • 2 • 1

OpenRLHF/Llama-3-8b-iter-dpo-179k

Text Generation • 8B • Updated Jul 28, 2024 • 2

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • 8B • Updated Jun 24, 2024 • 9 • 4

OpenRLHF/Llama-3-8b-sft-mixture

Text Generation • 8B • Updated Jun 14, 2024 • 1.18k • • 1

OpenRLHF/Llama-2-7b-sft-model-ocra-500k

Text Generation • 7B • Updated Jun 9, 2024 • 7

OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt

13B • Updated Jan 24, 2024 • 2

OpenRLHF/Llama-2-13b-sft-model-ocra-500k

Text Generation • 13B • Updated Jan 5, 2024 • 5 • 1

datasets 7

OpenRLHF/aime-2024

Viewer • Updated 28 days ago • 30 • 63

OpenRLHF/dapo-math-17k

Viewer • Updated 28 days ago • 17.4k • 71

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 9 • 1

OpenRLHF/prompt-collection-v0.1-dev-100k

Viewer • Updated Dec 13, 2024 • 102k • 9

OpenRLHF/preference_700K

Viewer • Updated Jul 13, 2024 • 700k • 44 • 1

OpenRLHF/prompt-collection-v0.1

Viewer • Updated Jun 14, 2024 • 179k • 251 • 6

OpenRLHF/preference_dataset_mixture2_and_safe_pku

Viewer • Updated Jun 14, 2024 • 555k • 208 • 6