M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
new activity
3 days ago
togethercomputer/Aurora-Spec-Minimax-M2.1:is there a FP8 version? updated
a model 26 days ago
togethercomputer/Aurora-Spec-Minimax-M2.1 updated
a model 29 days ago
togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8