1 69 8

Sweker

Swekerr

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

Swekerr/toxy-smollm2-360m-sft-v1.5

published a model 2 days ago

Swekerr/toxy-smollm2-360m-sft-v1.5

updated a model 8 days ago

Swekerr/toxy-smollm2-360m-sft-v1.0

View all activity

Organizations

updated a model 2 days ago

Swekerr/toxy-smollm2-360m-sft-v1.5

Text Generation • 0.4B • Updated 2 days ago • 17

published a model 2 days ago

Swekerr/toxy-smollm2-360m-sft-v1.5

Text Generation • 0.4B • Updated 2 days ago • 17

updated a model 8 days ago

Swekerr/toxy-smollm2-360m-sft-v1.0

Text Generation • 0.4B • Updated 8 days ago • 15

published a model 8 days ago

Swekerr/toxy-smollm2-360m-sft-v1.0

Text Generation • 0.4B • Updated 8 days ago • 15

updated a model 14 days ago

Swekerr/toxy-gemma3-270m-sft-v1.0

Text Generation • 0.3B • Updated 14 days ago • 34

published a model 14 days ago

Swekerr/toxy-gemma3-270m-sft-v1.0

Text Generation • 0.3B • Updated 14 days ago • 34

upvoted 2 articles about 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 646

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

Jul 9

• 666

upvoted an article 2 months ago

Article

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

and 8 others •

Jul 4

• 9

liked a Space 3 months ago

Planthy

🌿

a plant health monitoring app

updated a Space 3 months ago

Planthy

🌿

a plant health monitoring app

published a Space 3 months ago

Planthy

🌿

a plant health monitoring app

upvoted an article 3 months ago

Article

🐯 Liger GRPO meets TRL

and 5 others •

May 25

• 49

upvoted a paper 4 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 101

upvoted 3 articles 4 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

May 21

• 209

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 117

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 444

upvoted a paper 4 months ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29 • 93

upvoted an article 4 months ago

Article

Train your first Decision Transformer

and 1 other •

Sep 8, 2022

• 14

upvoted an article 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 211

Sweker

AI & ML interests

Recent Activity

Organizations

Swekerr's activity

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

Planthy

Planthy

Planthy

🐯 Liger GRPO meets TRL

nanoVLM: The simplest repository to train your VLM in pure PyTorch

The Transformers Library: standardizing model definitions

Vision Language Models Explained

Train your first Decision Transformer

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge