LLMs Evaluation Collection Evaluate models on key benchmarks. Thanks @clefourrier and @VictorSanh for the recommandations. • 12 items • Updated 15 days ago • 1
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published 9 days ago • 30
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 11 days ago • 57
ELAICHI Collection ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams. • 6 items • Updated Oct 24, 2024 • 6
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 18 days ago • 106
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 22 days ago • 35
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 25 days ago • 26
DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 13
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 10
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Paper • 2411.18197 • Published Nov 27, 2024 • 14