rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 10 days ago β’ 230
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 16 days ago β’ 37
Upcycling Large Language Models into Mixture of Experts Paper β’ 2410.07524 β’ Published Oct 10, 2024 β’ 4
view article Article The Great LLM Showdown: Amy's Quest for the Perfect LLM By wolfram β’ Jul 9, 2024 β’ 13
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper β’ 2404.03715 β’ Published Apr 4, 2024 β’ 61
DiJiang: Efficient Large Language Models through Compact Kernelization Paper β’ 2403.19928 β’ Published Mar 29, 2024 β’ 11