Amir Hossein Kargaran's picture

Amir Hossein Kargaran

kargaranamir

·

https://kargaranamir.github.io

AI & ML interests

#NLP, checkout https://huggingface.co/cis-lmu

Recent Activity

upvoted a paper about 6 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

authored a paper about 19 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

liked a dataset 2 days ago

microsoft/Taskbench

View all activity

Organizations

upvoted a paper about 6 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23

upvoted an article 4 days ago

Article

Transformers backend integration in SGLang

By

and 4 others •

5 days ago

• 35

upvoted an article 5 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 283

upvoted a paper 25 days ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published 26 days ago • 3

upvoted a paper about 1 month ago

Tracing Multilingual Factual Knowledge Acquisition in Pretraining

Paper • 2505.14824 • Published May 20 • 4

upvoted a paper about 2 months ago

Multilingual k-Nearest-Neighbor Machine Translation

Paper • 2310.14644 • Published Oct 23, 2023 • 2

upvoted a collection 3 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 624

upvoted 2 papers 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 292

upvoted 2 collections 3 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 553

— UI is a good thing 💅 —

cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 20

upvoted a paper 3 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 52

upvoted a paper 4 months ago

On Relation-Specific Neurons in Large Language Models

Paper • 2502.17355 • Published Feb 24 • 9

upvoted a collection 4 months ago

MMTEB

Our contribution to the Massive Multilingual Text Embedding Benchmark (MMTEB). Retrieval and reranking benchmarks in 16 languages. • 4 items • Updated Jun 6, 2024 • 3

upvoted a paper 4 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 36

upvoted a collection 4 months ago

CommonCrawl

Large web-mined general corpus based on CommonCrawl. • 8 items • Updated Apr 13 • 3

upvoted a paper 4 months ago

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Paper • 2502.05167 • Published Feb 7 • 15

upvoted 2 articles 5 months ago

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 305

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 618

upvoted an article 7 months ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

and 3 others •

Dec 8, 2024

• 23