Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 5 days ago • 68
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 3 days ago • 39
5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • 7 days ago • 19
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 14 days ago • 30
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 12 days ago • 43
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 189
LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • 20 days ago • 14
🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other • about 1 month ago • 65
Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization By TuringsSolutions • Feb 8, 2024 • 14
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 54
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 5 days ago • 68
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 3 days ago • 39
5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • 7 days ago • 19
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 14 days ago • 30
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 12 days ago • 43
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 189
LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • 20 days ago • 14
🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other • about 1 month ago • 65
Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization By TuringsSolutions • Feb 8, 2024 • 14
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 54