mlx-community/Qwen3-30B-A3B-4bit-DWQ-05082025 Text Generation โข 5B โข Updated May 8 โข 1.99k โข 5
nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation โข 8B โข Updated Apr 17 โข 2.66k โข 46
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper โข 2502.03275 โข Published Feb 5 โข 18