27 21 58

Haris Jabbar PRO

maveriq

AI & ML interests

Tokenization, language generation, normalizing flows, language modeling, document ai

Recent Activity

upvoted a paper about 1 month ago

TTRL: Test-Time Reinforcement Learning

liked a dataset about 2 months ago

Salesforce/xlam-function-calling-60k

upvoted an article 2 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

maveriq's activity

upvoted a paper about 1 month ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 112

liked a dataset about 2 months ago

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 3.8k • 459

upvoted an article 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 148

updated a dataset 3 months ago

spd-dev/codetest

Viewer • Updated Mar 21 • 124 • 37

published a dataset 3 months ago

spd-dev/codetest

Viewer • Updated Mar 21 • 124 • 37

liked a dataset 3 months ago

reasoning-course/certificates

Viewer • Updated about 15 hours ago • 246 • 349 • 2

New activity in huggingface/HuggingDiscussions 3 months ago

[FEEDBACK] Inference Providers

❤️ 17

113

#49 opened 5 months ago by

julien-c

upvoted an article 4 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

and 5 others •

Feb 4

• 89

upvoted a paper 4 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

upvoted an article 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.25k

liked a Space 4 months ago

DABstep Leaderboard

🕺

DABstep Reasoning Benchmark Leaderboard

liked a dataset 6 months ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6 • 48.3M • 20.3k • 314

upvoted a collection 6 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 24

upvoted a paper 6 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63

liked a Space 6 months ago

567

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute