Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
130
Ömer Kaya
andthattoo
Follow
KvnMln's profile picture
batuhanaktas's profile picture
Reza2kn's profile picture
8 followers
·
9 following
https://twitter.com/andthatto
andthatto
andthattoo
AI & ML interests
Synthetic data, verifiable information retrieval
Recent Activity
updated
a model
4 days ago
driaforall/Tiny-Agent-a-1.5B
liked
a model
4 days ago
microsoft/OmniParser-v2.0
reacted
to
Kseniase
's
post
with 🔥
5 days ago
8 New Applications of Test-Time Scaling We've noticed a huge interest in test-time scaling (TTS), so we decided to explore this concept further. Test-time compute (TTC) refers to the amount of computational power used by an AI model when generating a response. Many researchers are now focused on scaling TTC, as it enables slow, deep "thinking" and step-by-step reasoning, which improves overall models' performance. Here are 8 fresh studies on test-time scaling: 1. https://huggingface.co/papers/2502.05171 Introduces an LM that scales TTC by reasoning in latent space instead of generating more tokens with no special training. Here, a recurrent block to processes information iteratively. 2. https://huggingface.co/papers/2502.04728 Shows how TTS is applied to enhance model's Planning Domain Definition Language (PDDL) reasoning capabilities, which can be used to generate a symbolic world model. 3. https://huggingface.co/papers/2502.06703 Analyzes optimal TTS strategies and shows how small models can outperform much larger ones. 4. https://huggingface.co/papers/2502.04128 Shows how TTS improves expressiveness, timbre consistency and accuracy in speech synthesis with Llasa framework. It also dives into benefits of scaling train-time compute. 5. https://huggingface.co/papers/2502.07154 Suggests a modified training loss for better reasoning of LLMs when scaling TTC. 6. https://huggingface.co/papers/2502.05078 Unifies the strengths of chain, tree, and graph paradigms into one framework that expands reasoning only on necessary subproblems. 7. https://huggingface.co/papers/2502.01839 Explores scaling trends of self-verification and how to improve its capabilities with TTC. 8. https://huggingface.co/papers/2501.14723 Explores how scaling serial compute (iterations) and parallel compute (trajectories), can improve accuracy in real-world software engineering issues. Also, explore our article about TTS for more -> https://huggingface.co/blog/Kseniase/testtimecompute
View all activity
Organizations
andthattoo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
4 days ago
microsoft/OmniParser-v2.0
Image-Text-to-Text
•
Updated
3 days ago
•
3.58k
•
802
liked
a model
5 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
12 days ago
•
1.08M
•
•
901
liked
a dataset
7 days ago
AI-MO/NuminaMath-1.5
Viewer
•
Updated
11 days ago
•
896k
•
1.85k
•
106
liked
a model
10 days ago
intfloat/multilingual-e5-large-instruct
Feature Extraction
•
Updated
5 days ago
•
476k
•
•
331
liked
a dataset
11 days ago
driaforall/verifiable-pythonic-function-calling-lite
Viewer
•
Updated
14 days ago
•
16.4k
•
184
•
5
liked
a Space
15 days ago
Running
on
A10G
1.23k
1.23k
GGUF My Repo
🦙
liked
a dataset
16 days ago
simplescaling/s1K
Viewer
•
Updated
11 days ago
•
1k
•
4.59k
•
175
liked
2 datasets
17 days ago
adyen/DABstep
Viewer
•
Updated
11 days ago
•
10.4k
•
2.38k
•
9
TIGER-Lab/AceCode-87K
Viewer
•
Updated
13 days ago
•
87.1k
•
809
•
31
liked
2 datasets
23 days ago
cognitivecomputations/dolphin-r1
Viewer
•
Updated
22 days ago
•
814k
•
5.35k
•
262
nisten/all-human-diseases
Viewer
•
Updated
Aug 19, 2024
•
2.2k
•
160
•
106
liked
a dataset
25 days ago
driaforall/pythonic-function-calling
Viewer
•
Updated
15 days ago
•
81.8k
•
498
•
19
liked
a model
25 days ago
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
6 days ago
•
1.45M
•
510
liked
a model
26 days ago
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
•
Updated
23 days ago
•
289k
•
235
liked
a model
27 days ago
dnhkng/RYS-XLarge
Text Generation
•
Updated
Oct 11, 2024
•
2.06k
•
85
liked
a dataset
27 days ago
THUDM/ComplexFuncBench
Updated
about 1 month ago
•
199
•
3
liked
a model
28 days ago
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
Oct 15, 2024
•
40
•
71
liked
3 models
about 1 month ago
qresearch/Llama-3.2-1B-Instruct-SAE-l9
Updated
about 1 month ago
•
13
Qwen/Qwen2.5-Coder-1.5B
Text Generation
•
Updated
Nov 18, 2024
•
10.3k
•
46
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
12 days ago
•
4.35M
•
•
9.85k
Load more