68 48 256

Amir Hossein Kargaran

kargaranamir

https://kargaranamir.github.io

AI & ML interests

#NLP, checkout https://huggingface.co/cis-lmu

Recent Activity

upvoted a paper about 6 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

authored a paper about 19 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

liked a dataset 2 days ago

microsoft/Taskbench

View all activity

Organizations

upvoted a paper about 6 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23

authored a paper about 19 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23

liked a dataset 2 days ago

microsoft/Taskbench

Viewer • Updated Aug 21, 2024 • 17.3k • 906 • 30

liked 2 models 2 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 551k • 1.44k

microsoft/Phi-3.5-mini-instruct

Text Generation • 4B • Updated Mar 2 • 258k • • 878

liked a model 3 days ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • 3B • Updated Jan 16 • 46.2k • 401

upvoted an article 4 days ago

Article

Transformers backend integration in SGLang

and 4 others •

5 days ago

• 35

upvoted an article 5 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 283

liked a model 11 days ago

mistralai/Magistral-Small-2506

Text Generation • 24B • Updated 12 days ago • 52.3k • • 529

updated a dataset 16 days ago

kargaranamir/parallel

Updated 16 days ago • 27 • 1

liked a model 17 days ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated May 21 • 1.69M • • 418

reacted to jeffboudier's post with 🚀 23 days ago

Post

2588

Transcribing 1 hour of audio for less than $0.01 🤯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true

liked a model 23 days ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.69M • • 2.46k

updated a dataset 23 days ago

cis-lmu/glotlid-corpus

Viewer • Updated 23 days ago • 288M • 130 • 7

upvoted a paper 25 days ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published 26 days ago • 3

commented a paper 25 days ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published 26 days ago • 3 •

authored a paper 25 days ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published 26 days ago • 3

updated a dataset 26 days ago

cis-lmu/glotlid-wordlists

Viewer • Updated 26 days ago • 3.12M • 116 • 1

liked a dataset 29 days ago

kargaranamir/parallel

Updated 16 days ago • 27 • 1

published a dataset 29 days ago

kargaranamir/parallel

Updated 16 days ago • 27 • 1

Amir Hossein Kargaran

AI & ML interests

Recent Activity

Organizations

kargaranamir's activity

Transformers backend integration in SGLang

Tiny Agents: a MCP-powered agent in 50 lines of code