The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published 5 days ago β’ 74
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published 4 days ago β’ 258
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 19 days ago β’ 23
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 16 days ago β’ 37
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 15 days ago β’ 30
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 β’ 11 days ago β’ 24