MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 1.04k • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 5.3k • • 7 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 26 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 206 • 1
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 117k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 907 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.21k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 2k • 25
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 8 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 870 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 144 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 144 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated Feb 19 • 15.1k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 138 • 4
MT Datasets BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 6 days ago • 25 BSC-LT/MULTI_corpus Viewer • Updated 6 days ago • 468k • 31 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 378 BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 88 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 28 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 70 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 14 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 181 • 3
MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 1.04k • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 5.3k • • 7 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 26 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 206 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated Feb 19 • 15.1k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 138 • 4
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 117k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 907 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.21k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 2k • 25
MT Datasets BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 6 days ago • 25 BSC-LT/MULTI_corpus Viewer • Updated 6 days ago • 468k • 31 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 378 BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 88 • 1
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 8 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 870 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 144 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 144 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 28 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 70 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 14 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 181 • 3