archit's picture

archit

archit11

AI & ML interests

small language models, looking for work please reachout [email protected]

Recent Activity

upvoted a paper about 12 hours ago
Reinforcement Pre-Training
updated a model 1 day ago
archit11/fuchsia-grpo-finetuned-model
published a model 1 day ago
archit11/fuchsia-grpo-finetuned-model
View all activity

Organizations

Literally Me FRFR Research Society's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture IndiaBuild's profile picture Hugging Face Discord Community's profile picture LeRobot Worldwide Hackathon's profile picture Hugging Face MCP Course's profile picture Agents-MCP-Hackathon's profile picture

archit11's activity

upvoted an article 2 months ago
view article
Article

Enabling Long Context Training with Sequence Parallelism in Axolotl

By axolotl-ai-co and 1 other β€’
β€’ 8
upvoted an article 3 months ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

By ariG23498 and 2 others β€’
β€’ 165
upvoted an article 4 months ago
view article
Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By Pclanglais β€’
β€’ 30
upvoted 2 articles 4 months ago
view article
Article

How to deploy and fine-tune DeepSeek models on AWS

By pagezyhf and 2 others β€’
β€’ 52
view article
Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

By davanstrien β€’
β€’ 8
upvoted an article 5 months ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

By tomaarsen β€’
β€’ 187
upvoted 2 articles 7 months ago