忍者's picture

90 297

忍者

byteprobe

·

AI & ML interests

RL | NLP | LLM | LMM | agent

Recent Activity

upvoted a collection 2 days ago

upvoted an article 2 days ago

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

liked a model 2 days ago

MiniMaxAI/MiniMax-VL-01

View all activity

Organizations

byteprobe's activity

upvoted a collection 2 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 27 days ago • 204

upvoted an article 2 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

3 days ago

• 42

upvoted 6 articles 3 days ago

Article

Visualize and understand GPU memory in PyTorch

26 days ago

• 162

Article

Welcome the Falcon 3 Family of Open Models!

Dec 17, 2024

• 116

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Dec 16, 2024

• 82

Article

Finally, a Replacement for BERT: Introducing ModernBERT

about 1 month ago

• 487

Article

Visual Document Retrieval Goes Multilingual

9 days ago

• 57

Article

Train 400x faster Static Embedding Models with Sentence Transformers

4 days ago

• 102

upvoted 3 papers 14 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 54

1.58-bit FLUX

Paper • 2412.18653 • Published 25 days ago • 72

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 24 days ago • 94

upvoted 2 collections 14 days ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated 15 days ago • 23

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 12 days ago • 74

upvoted a paper 14 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 18 days ago • 15

upvoted an article 15 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

16 days ago

• 37

upvoted an article 19 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76

upvoted a collection 19 days ago

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28

upvoted an article 19 days ago

Article

They Said It Couldn’t Be Done

By

•

Dec 5, 2024

• 76

upvoted 2 papers 19 days ago

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 25

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 123