Tushar Gupta

tusharg92

tusg

AI & ML interests

NLP, deep learning, machine learning

Recent Activity

upvoted an article 28 days ago

🪆 Introduction to Matryoshka Embedding Models

upvoted an article 28 days ago

Transformers backend integration in SGLang

upvoted an article 5 months ago

SmolVLM2: Bringing Video Understanding to Every Device

View all activity

Organizations

None yet

upvoted 2 articles 28 days ago

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 149

Article

Transformers backend integration in SGLang

and 4 others •

about 1 month ago

• 48

upvoted 2 articles 5 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 285

Article

1 Billion Classifications

•

Feb 13

• 43

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

upvoted a paper 9 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

upvoted 2 papers 10 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 152

8-bit Optimizers via Block-wise Quantization

Paper • 2110.02861 • Published Oct 6, 2021 • 2

upvoted a collection 10 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 2 days ago • 627

upvoted 2 articles 10 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

and 5 others •

Sep 18, 2024

• 261

Article

Accelerate 1.0.0

and 2 others •

Sep 13, 2024

• 53

upvoted a paper 11 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

upvoted a paper about 1 year ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 137

upvoted an article about 1 year ago

Article

Welcome Gemma 2 - Google's new open LLM

and 5 others •

Jun 27, 2024

• 130

upvoted 2 collections about 1 year ago

Jina Reranker v2

Collection

A collection of state-of-the-art multilingual neural rerankers • 1 item • Updated 3 days ago • 9

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 2 days ago • 368

Tushar Gupta

AI & ML interests

Recent Activity

Organizations

tusharg92's activity

🪆 Introduction to Matryoshka Embedding Models

Transformers backend integration in SGLang

SmolVLM2: Bringing Video Understanding to Every Device

1 Billion Classifications

Open-source DeepResearch – Freeing our search agents

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Accelerate 1.0.0

Welcome Gemma 2 - Google's new open LLM