Bread's picture

Bread

kimleang123

·

AI & ML interests

NLP && Computer vision && Generative AI

Recent Activity

liked a model about 23 hours ago

Qwen/Qwen3-Reranker-0.6B

liked a model about 23 hours ago

Qwen/Qwen3-Embedding-0.6B

liked a dataset 1 day ago

rinabuoy/Khmer-Flick3k

View all activity

Organizations

kimleang123's activity

upvoted an article 4 days ago

Article

Google Gemini Diffusion: What's It About?

By

•

16 days ago

• 5

upvoted a collection 24 days ago

BGE

30 items • Updated 18 days ago • 119

upvoted an article 29 days ago

Article

🪆 Introduction to Matryoshka Embedding Models

By

and 2 others •

Feb 23, 2024

• 124

upvoted a collection about 1 month ago

Qwen3

40 items • Updated 17 days ago • 738

upvoted 2 articles about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 267

Article

Merge Large Language Models with mergekit

By

•

Jan 9, 2024

• 120

upvoted a paper about 1 month ago

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11 • 7

upvoted an article about 2 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9, 2024

• 55

upvoted an article 2 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

By

•

Mar 26

• 134

upvoted a collection 2 months ago

Quantized Qwen2.5

9 items • Updated Dec 9, 2024 • 4

upvoted a paper 3 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 22

upvoted a collection 3 months ago

QwQ

Qwen with Questions • 6 items • Updated Apr 28 • 95

upvoted a paper 3 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 120

upvoted 2 articles 4 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 66

Article

Welcome fastText to the 🤗 Hub

By

and 1 other •

Jun 6, 2023

• 3

upvoted 2 papers 4 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 30

upvoted a collection 4 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 266

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 862

upvoted an article 5 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 185