view article Article TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell about 4 hours ago • 5
facebook/dinov3-convnext-base-pretrain-lvd1689m Image Feature Extraction • 87.6M • Updated Aug 19, 2025 • 37.3k • 10
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms Paper • 2511.17592 • Published Nov 17, 2025 • 118
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story Paper • 2511.15210 • Published Nov 19, 2025 • 89
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published Nov 17, 2025 • 136
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published May 20, 2025 • 78
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 59
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published Mar 20, 2025 • 72
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5, 2025 • 232
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published Feb 20, 2025 • 91
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20, 2025 • 193
Follow Anything: Open-set detection, tracking, and following in real-time Paper • 2308.05737 • Published Aug 10, 2023 • 12