view article Article I trained a Language Model to schedule events with GRPO! By anakin87 • 18 days ago • 70
view article Article Welcoming Llama Guard 4 on Hugging Face Hub By merve and 3 others • 18 days ago • 35
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published 16 days ago • 22
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation Paper • 2412.10151 • Published Dec 13, 2024 • 7
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean Paper • 2403.10882 • Published Mar 16, 2024 • 6
ORPO Collection This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12, 2024 • 11