Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61
Running 2.56k 2.56k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 13
open-llm-leaderboard/Qwen__Qwen2.5-Math-7B-Instruct-details Viewer • Updated Feb 13 • 43.2k • 114 • 1
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 57