Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER}
🔄 In a Training Loop
Gurvaah Singh
ReallyFloppyPenguin
AI & ML interests
AI, GGUFing AI, AI, Running AI, Thinking about AI, and so on
Recent Activity
liked a model 4 days ago
JBrussee/gemma-4-31B-caveman-lora liked a Space 6 days ago
chopratejas/kompress-v2-base-demo liked a Space 9 days ago
HuggingFaceTB/smol-training-playbookOrganizations
Datasets That Kill
Sikh Models
-
HuggingFaceTB/SmolLM3-3B
Text Generation • 3B • Updated • 686k • 981 -
Qwen/Qwen3-4B
Text Generation • 4B • Updated • 12M • 644 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.69M • • 6.21k -
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 3.59M • 2.68k
GGUFs
Interesting Papers
-
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
System Prompt Optimization with Meta-Learning
Paper • 2505.09666 • Published • 72 -
Visual Planning: Let's Think Only with Images
Paper • 2505.11409 • Published • 57
MathRL
Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER}
Datasets That Kill
Free AI!!!
Sikh Models
-
HuggingFaceTB/SmolLM3-3B
Text Generation • 3B • Updated • 686k • 981 -
Qwen/Qwen3-4B
Text Generation • 4B • Updated • 12M • 644 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.69M • • 6.21k -
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 3.59M • 2.68k
Revolutionary Models
GGUFs
Ultra Cool Models
Interesting Papers
-
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
System Prompt Optimization with Meta-Learning
Paper • 2505.09666 • Published • 72 -
Visual Planning: Let's Think Only with Images
Paper • 2505.11409 • Published • 57