Running 532 532 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
view article Article How to build a custom text classifier without days of human labeling By sdiazlor and 4 others • Oct 17, 2024 • 55
view post Post 5540 Multimodal Ichigo Llama 3.1 - Real Time Voice AI 🔥> WhisperSpeech X Llama 3.1 8B> Trained on 50K hours of speech (7 languages)> Continually trained on 45hrs 10x A1000s> MLS -> WhisperVQ tokens -> Llama 3.1> Instruction tuned on 1.89M samples> 70% speech, 20% transcription, 10% text> Apache 2.0 licensed ⚡Architecture:> WhisperSpeech/ VQ for Semantic Tokens> Llama 3.1 8B Instruct for Text backbone> Early fusion (Chameleon)I'm super bullish on HomeBrew/ Jan and early fusion, audio and text, multimodal models!(P.S. Play with the demo on Hugging Face: jan-hq/Ichigo-llama3.1-s-instruct) 🔥 16 16 👍 5 5 ❤️ 2 2 😎 1 1 👀 1 1 🚀 1 1 + Reply