MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 282 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 2.24k • • 9 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 50 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 50 • 1
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 18.1k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 957 • 30 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 5.67k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 1.63k • 27
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 13 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 719 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 78 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 78 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated 22 days ago • 695 • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 109 • 5
MT Datasets BSC-LT/BSC_ParaMT_8 Viewer • Updated 8 days ago • 733M • 99 BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 30 days ago • 49 BSC-LT/MULTI_corpus Viewer • Updated 30 days ago • 468k • 61 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 93
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 17 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 33 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 26 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 62 • 3
MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 282 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 2.24k • • 9 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 50 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 50 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated 22 days ago • 695 • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 109 • 5
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 18.1k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 957 • 30 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 5.67k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 1.63k • 27
MT Datasets BSC-LT/BSC_ParaMT_8 Viewer • Updated 8 days ago • 733M • 99 BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 30 days ago • 49 BSC-LT/MULTI_corpus Viewer • Updated 30 days ago • 468k • 61 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 93
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 13 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 719 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 78 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 78 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 17 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 33 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 26 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 62 • 3