OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 8 items • Updated May 3 • 7
SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks Paper • 2501.11599 • Published Jan 20 • 1
view article Article Perceiver IO: a scalable, fully-attentional model that works on any modality By nielsr • Dec 15, 2021 • 9
view article Article Spread Your Wings: Falcon 180B is here By philschmid and 4 others • Sep 6, 2023 • 7
view article Article 🤗 PEFT welcomes new merging methods By smangrul and 1 other • Feb 19, 2024 • 19
view article Article Mixture of Experts Explained By osanseviero and 5 others • Dec 11, 2023 • 678
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 13 days ago • 148