view article Article TGI Multi-LoRA: Deploy Once, Serve 30 Models By derek-thomas and 2 others • Jul 18, 2024 • 60
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 17 days ago • 61
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Paper • 2508.04324 • Published 16 days ago • 9
ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning Paper • 2508.10419 • Published 9 days ago • 68
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper • 2507.22827 • Published 23 days ago • 94
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20, 2024 • 127
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory Paper • 2508.09736 • Published 9 days ago • 49
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Paper • 2508.13009 • Published 4 days ago • 20
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 64
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published 3 days ago • 48
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published 21 days ago • 32