PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published Dec 30, 2024 • 19
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… Oct 22, 2024 • 71
Phi-4 Collection Phi-4 family of small language and multi-modal models. • 7 items • Updated 6 days ago • 108
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Paper • 2410.06885 • Published Oct 9, 2024 • 44
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 27 days ago • 60
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 17 days ago • 49
GeoPixel Collection Pixel Grounding Large Multimodal Model in Remote Sensing • 5 items • Updated 12 days ago • 1
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 13 items • Updated 8 days ago • 8
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 20 days ago • 30
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 17 days ago • 10
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 20 days ago • 42
view article Article Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model By mayank-mishra • Apr 2, 2024 • 7