Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset By Aratako • 6 days ago • 5
GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities By cabbage972 • 6 days ago
Protocolic Media: Structured Intelligence and the Future of Cognitive Environments By kanaria007 • 7 days ago
Decoding the Shift and Diffusion Models Training Like Qwen Image, FLUX, SDXL, and More By MonsterMMORPG • 7 days ago • 1
<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p> By hba123 • 8 days ago • 12
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 8 days ago • 56
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation By Alibaba-DAMO-Academy and 9 others • 9 days ago • 25
Luth: Efficient French Specialization for Small Language Models By MaxLSB and 1 other • 9 days ago • 9