Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
130
Ömer Kaya
andthattoo
Follow
erhant-fb's profile picture
Reza2kn's profile picture
batuhanaktas's profile picture
8 followers
·
8 following
https://twitter.com/andthatto
andthatto
andthattoo
AI & ML interests
Synthetic data, verifiable information retrieval
Recent Activity
updated
a model
3 days ago
driaforall/Tiny-Agent-a-1.5B
liked
a model
3 days ago
microsoft/OmniParser-v2.0
reacted
to
Kseniase
's
post
with 🔥
4 days ago
8 New Applications of Test-Time Scaling We've noticed a huge interest in test-time scaling (TTS), so we decided to explore this concept further. Test-time compute (TTC) refers to the amount of computational power used by an AI model when generating a response. Many researchers are now focused on scaling TTC, as it enables slow, deep "thinking" and step-by-step reasoning, which improves overall models' performance. Here are 8 fresh studies on test-time scaling: 1. https://huggingface.co/papers/2502.05171 Introduces an LM that scales TTC by reasoning in latent space instead of generating more tokens with no special training. Here, a recurrent block to processes information iteratively. 2. https://huggingface.co/papers/2502.04728 Shows how TTS is applied to enhance model's Planning Domain Definition Language (PDDL) reasoning capabilities, which can be used to generate a symbolic world model. 3. https://huggingface.co/papers/2502.06703 Analyzes optimal TTS strategies and shows how small models can outperform much larger ones. 4. https://huggingface.co/papers/2502.04128 Shows how TTS improves expressiveness, timbre consistency and accuracy in speech synthesis with Llasa framework. It also dives into benefits of scaling train-time compute. 5. https://huggingface.co/papers/2502.07154 Suggests a modified training loss for better reasoning of LLMs when scaling TTC. 6. https://huggingface.co/papers/2502.05078 Unifies the strengths of chain, tree, and graph paradigms into one framework that expands reasoning only on necessary subproblems. 7. https://huggingface.co/papers/2502.01839 Explores scaling trends of self-verification and how to improve its capabilities with TTC. 8. https://huggingface.co/papers/2501.14723 Explores how scaling serial compute (iterations) and parallel compute (trajectories), can improve accuracy in real-world software engineering issues. Also, explore our article about TTS for more -> https://huggingface.co/blog/Kseniase/testtimecompute
View all activity
Organizations
Articles
2
Article
5
Dria Pythonic Agent Benchmark (DPAB)
Article
23
Python Is All You Need? Introducing Dria-Agent-α
View all Articles
models
1
andthattoo/subquery-SmolLM
Text Generation
•
Updated
Aug 16, 2024
•
20
•
2
datasets
3
Sort: Recently updated
andthattoo/deepseek_reasoner
Viewer
•
Updated
15 days ago
•
300k
•
53
andthattoo/router-dpo
Viewer
•
Updated
Oct 30, 2024
•
1.92k
•
70
andthattoo/subqueries
Viewer
•
Updated
Aug 16, 2024
•
34k
•
54
•
1