FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper ⢠2506.20920 ⢠Published 2 days ago ⢠23
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper ⢠2506.20920 ⢠Published 2 days ago ⢠23
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition ⢠6B ⢠Updated May 1 ⢠551k ⢠1.44k
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others ⢠5 days ago ⢠35
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c ⢠Apr 25 ⢠283
view post Post 2588 Transcribing 1 hour of audio for less than $0.01 𤯠@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!How they did it: https://huggingface.co/blog/fast-whisper-endpoints1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws®ion=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true See translation š 10 10 + Reply
openai/whisper-large-v3-turbo Automatic Speech Recognition ⢠0.8B ⢠Updated Oct 4, 2024 ⢠3.69M ⢠⢠2.46k
How Programming Concepts and Neurons Are Shared in Code Language Models Paper ⢠2506.01074 ⢠Published 26 days ago ⢠3
How Programming Concepts and Neurons Are Shared in Code Language Models Paper ⢠2506.01074 ⢠Published 26 days ago ⢠3 ⢠2
How Programming Concepts and Neurons Are Shared in Code Language Models Paper ⢠2506.01074 ⢠Published 26 days ago ⢠3