view article Article Train 400x faster Static Embedding Models with Sentence Transformers 15 days ago β’ 128
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 134
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 642
FlashSpeech: Efficient Zero-Shot Speech Synthesis Paper β’ 2404.14700 β’ Published Apr 23, 2024 β’ 31
Proactive Detection of Voice Cloning with Localized Watermarking Paper β’ 2401.17264 β’ Published Jan 30, 2024 β’ 18
Masked Audio Generation using a Single Non-Autoregressive Transformer Paper β’ 2401.04577 β’ Published Jan 9, 2024 β’ 43
Pheme: Efficient and Conversational Speech Generation Paper β’ 2401.02839 β’ Published Jan 5, 2024 β’ 18
CoMoSVC: Consistency Model-based Singing Voice Conversion Paper β’ 2401.01792 β’ Published Jan 3, 2024 β’ 11
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper β’ 2312.11514 β’ Published Dec 12, 2023 β’ 259
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper β’ 2312.09911 β’ Published Dec 15, 2023 β’ 54
StemGen: A music generation model that listens Paper β’ 2312.08723 β’ Published Dec 14, 2023 β’ 48
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning Paper β’ 2312.06134 β’ Published Dec 11, 2023 β’ 3
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration Paper β’ 2311.04257 β’ Published Nov 7, 2023 β’ 21
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Paper β’ 2312.03491 β’ Published Dec 6, 2023 β’ 34
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models Paper β’ 2312.03632 β’ Published Dec 6, 2023 β’ 5
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper β’ 2312.00752 β’ Published Dec 1, 2023 β’ 140
Merlin:Empowering Multimodal LLMs with Foresight Minds Paper β’ 2312.00589 β’ Published Nov 30, 2023 β’ 25