view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • Jun 4, 2024 • 79
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Paper • 2505.10468 • Published May 15 • 9
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper • 2505.08581 • Published May 13 • 9
Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning Paper • 2505.09738 • Published May 14 • 9
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper • 2505.10558 • Published May 15 • 15
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published May 15 • 12
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Paper • 2505.10320 • Published May 15 • 23
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published May 15 • 120
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 487
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 1 day ago • 52
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 350
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 236
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 176
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Paper • 2410.23320 • Published Oct 30, 2024 • 8