grib0ed0v (Alexey G)

upvoted a collection 4 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 586

upvoted an article 5 months ago

Article

The Open Arabic LLM Leaderboard 2

By

and 7 others •

Feb 10

• 33

upvoted 8 articles 6 months ago

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 484

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

By

and 2 others •

Jan 23

• 182

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 197

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.09k

Article

We now support VLMs in smolagents!

By

and 2 others •

Jan 24

• 107

Article

Finally, a Replacement for BERT: Introducing ModernBERT

By

and 14 others •

Dec 19, 2024

• 670

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

By

and 1 other •

Jan 16

• 75

Article

Timm ❤️ Transformers: Use any timm model with transformers

By

and 4 others •

Jan 16

• 50

upvoted 2 collections 8 months ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 17 days ago • 58

Cultura-Ru-Edu

Collection

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated Nov 26, 2024 • 5

upvoted 2 papers 8 months ago

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 70

upvoted an article 8 months ago

Article

Let’s make a generation of amazing image generation models

By

and 4 others •

Nov 26, 2024

• 33

upvoted a paper 8 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 56

upvoted 4 papers 9 months ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 67

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 12

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 18

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 21

Alexey G

AI & ML interests

Organizations

Llama 4

The Open Arabic LLM Leaderboard 2

Welcome to Inference Providers on the Hub 🔥

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Train 400x faster Static Embedding Models with Sentence Transformers

Introducing smolagents: simple agents that write actions in code.

We now support VLMs in smolagents!

Finally, a Replacement for BERT: Introducing ModernBERT

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Timm ❤️ Transformers: Use any timm model with transformers

SigLIP

Cultura-Ru-Edu

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention Is All You Need

Let’s make a generation of amazing image generation models

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Large Language Models Can Self-Improve in Long-context Reasoning

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Alexey G

AI & ML interests

Organizations

grib0ed0v's activity

The Open Arabic LLM Leaderboard 2

Welcome to Inference Providers on the Hub 🔥

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Train 400x faster Static Embedding Models with Sentence Transformers

Introducing smolagents: simple agents that write actions in code.

We now support VLMs in smolagents!

Finally, a Replacement for BERT: Introducing ModernBERT

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Timm ❤️ Transformers: Use any timm model with transformers

Let’s make a generation of amazing image generation models