忍者's picture

忍者

byteprobe

AI & ML interests

RL | NLP | LLM | LMM | agent

Recent Activity

Organizations

LocalLLaMA's profile picture MLX Community's profile picture Hugging Face 1Bit LLMs's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture

byteprobe's activity

upvoted an article 2 days ago
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

42
upvoted 6 articles 3 days ago
view article
Article

Visualize and understand GPU memory in PyTorch

162
view article
Article

Welcome the Falcon 3 Family of Open Models!

116
view article
Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

82
view article
Article

Finally, a Replacement for BERT: Introducing ModernBERT

487
view article
Article

Visual Document Retrieval Goes Multilingual

57
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

102
upvoted an article 15 days ago
view article
Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By wolfram
37
upvoted an article 19 days ago
view article
Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By wolfram
76
upvoted an article 19 days ago