facebook
/

wav2vec2-xls-r-1b

xls_r_pretrained

Model card Files Files and versions

aconneau commited on Nov 16, 2021

Commit

6b29317

·

1 Parent(s): 8cb55d3

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ license: apache-2.0
 # Wav2Vec2-XLS-R-1B
 [Facebook's Wav2Vec2 XLS-R](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)
 XLS-R is Facebook AI's large-scale multilingual pretrained model for speech (the "XLM-R for Speech"). It is pretrained on 436k hours of unlabeled speech, including VoxPopuli, MLS, CommonVoice, BABEL and VoxLingua107. Is uses the wav2vec 2.0 objective, in 128 languages. When using the model make sure that your speech input is sampled at 16Khz. Note that this model should be fine-tuned on a downstream task, like Automatic Speech Recognition, Translation or Classification. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more information about ASR.
@@ -34,5 +36,3 @@ You can find other pretrained XLS-R models with different numbers of parameters:
 * [1B version version](https://huggingface.co/facebook/wav2vec2-xls-r-1b)
 * [2B version version](https://huggingface.co/facebook/wav2vec2-xls-r-2b)
-![model image](https://raw.githubusercontent.com/patrickvonplaten/scientific_images/master/xls_r.png)

 # Wav2Vec2-XLS-R-1B
+![model image](https://raw.githubusercontent.com/patrickvonplaten/scientific_images/master/xls_r.png)
 [Facebook's Wav2Vec2 XLS-R](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)
 XLS-R is Facebook AI's large-scale multilingual pretrained model for speech (the "XLM-R for Speech"). It is pretrained on 436k hours of unlabeled speech, including VoxPopuli, MLS, CommonVoice, BABEL and VoxLingua107. Is uses the wav2vec 2.0 objective, in 128 languages. When using the model make sure that your speech input is sampled at 16Khz. Note that this model should be fine-tuned on a downstream task, like Automatic Speech Recognition, Translation or Classification. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more information about ASR.
 * [1B version version](https://huggingface.co/facebook/wav2vec2-xls-r-1b)
 * [2B version version](https://huggingface.co/facebook/wav2vec2-xls-r-2b)