1 3 3

Shirin Yamani

ShirinYamani

shirinyamani

AI & ML interests

Core ML

Recent Activity

upvoted an article 10 days ago

🐯 Liger GRPO meets TRL

commented on their article 10 days ago

🐯 Liger GRPO meets TRL

updated a model 13 days ago

ShirinYamani/Qwen3-4B-Base-SFT

View all activity

Organizations

ShirinYamani's activity

upvoted an article 10 days ago

Article

🐯 Liger GRPO meets TRL

and 5 others •

13 days ago

• 36

commented on 🐯 Liger GRPO meets TRL 10 days ago

it depends on the setup usually, might be effective in boost, and may not either!

updated a model 13 days ago

ShirinYamani/Qwen3-4B-Base-SFT

Text Generation • Updated 13 days ago • 4

published a model 13 days ago

ShirinYamani/Qwen3-4B-Base-SFT

Text Generation • Updated 13 days ago • 4

published an article 13 days ago

Article

🐯 Liger GRPO meets TRL

and 5 others •

13 days ago

• 36

published a Space 13 days ago

Strl Sft

🐢

published a Space 14 days ago

SFT Job

💻

testing SFT script

updated a model about 1 month ago

trl-internal-testing/tiny-Qwen3ForCausalLM

Text Generation • Updated May 5 • 4

published a model about 1 month ago

trl-internal-testing/tiny-Qwen3ForCausalLM

Text Generation • Updated May 5 • 4

updated a dataset about 1 month ago

trl-lib/documentation-images

Viewer • Updated 5 days ago • 7 • 118k

upvoted an article about 2 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 37

upvoted a paper 3 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128

liked 2 Spaces 3 months ago

2.66k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

Predict Memory

🧮

Calculate memory usage from model configurations

updated a model 6 months ago

ShirinYamani/Qwen2.5-0.5B-SFT-model

Updated Dec 20, 2024

updated a dataset 6 months ago

ShirinYamani/2011-2017-load

Viewer • Updated Nov 30, 2024 • 52.6k • 22

updated a model 9 months ago

ShirinYamani/chronos-t5-small-fine-tuned

Text2Text Generation • Updated Sep 5, 2024 • 9

updated a dataset 10 months ago

ShirinYamani/ts

Updated Aug 7, 2024 • 24

updated 2 models 12 months ago

ShirinYamani/llama-2-7b-fine-tuned

Updated Jun 24, 2024 • 5

ShirinYamani/huggyllama-llama-7b-finetuned

Text Generation • Updated Jun 20, 2024