Florian Zimmermeister PRO

flozi00

AI & ML interests

ASR, German LLM

Recent Activity

Organizations

Training Transformers Together's profile picture Speech Recognition Community Event Version 2's profile picture A\\Ware's profile picture primeLine AI Services's profile picture ZeroGPU Explorers's profile picture primeLine Research Community's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

flozi00's activity

liked a Space about 12 hours ago
reacted to MoritzLaurer's post with 👀 1 day ago
view post
Post
1763
Microsoft's rStar-Math paper claims that 🤏 ~7B models can match the math skills of o1 using clever train- and test-time techniques. You can now download their prompt templates from Hugging Face !

📏 The paper introduces rStar-Math, which claims to rival OpenAI o1's math reasoning capabilities by integrating Monte Carlo Tree Search (MCTS) with step-by-step verified reasoning trajectories.
🤖 A Process Preference Model (PPM) enables fine-grained evaluation of intermediate steps, improving training data quality.
🧪 The system underwent four rounds of self-evolution, progressively refining both the policy and reward models to tackle Olympiad-level math problems—without GPT-4-based data distillation.
💾 While we wait for the release of code and datasets, you can already download the prompts they used from the HF Hub!

Details and links here 👇
Prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/
Templates on the hub: MoritzLaurer/rstar-math-prompts
Prompt-templates collection: MoritzLaurer/prompt-templates-6776aa0b0b8a923957920bb4
Paper: https://arxiv.org/pdf/2501.04519
upvoted an article 1 day ago
replied to their post 16 days ago