dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

liked a dataset about 8 hours ago

SWE-bench/SWE-smith

liked a model about 8 hours ago

SWE-bench/SWE-agent-LM-32B

View all activity

Organizations

None yet

huba-buba's activity

upvoted 2 papers 1 day ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 2 days ago • 68

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 3 days ago • 56

upvoted a collection 7 days ago

Phi-4 (All Versions)

Collection

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 6 days ago • 68

upvoted 2 collections 9 days ago

Qwen3

Collection

27 items • Updated 6 days ago • 538

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 7 days ago • 137

upvoted a paper 14 days ago

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published 15 days ago • 20

upvoted a paper 15 days ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published 16 days ago • 63

upvoted an article 19 days ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 242

upvoted a paper 20 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 22 days ago • 60

upvoted a paper 21 days ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 23 days ago • 84

upvoted an article 25 days ago

Article

The NLP Course is becoming the LLM Course!

Apr 3

• 90

upvoted a paper 27 days ago

Self-Steering Language Models

Paper • 2504.07081 • Published 28 days ago • 18

upvoted an article 29 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 243

upvoted a paper 29 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published about 1 month ago • 180

upvoted a paper 30 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published about 1 month ago • 25

upvoted 2 papers about 1 month ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 273

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published Apr 1 • 26