tensorspeech
/

tts-tacotron2-ljspeech-en

Model card Files Files and versions

dathudeptrai commited on Jun 1, 2021

Commit

efdbf66

·

1 Parent(s): 8f7e883

🦋 Update README

Files changed (1) hide show

README.md +77 -0

README.md CHANGED Viewed

@@ -5,6 +5,83 @@ tags:
 - text-to-speech
 - text-to-mel
 language: en
 widget:
 - text: "Hello, how are you doing?"
 ---

 - text-to-speech
 - text-to-mel
 language: en
+license: apache-2.0
+datasets:
+- ljspeech
 widget:
 - text: "Hello, how are you doing?"
 ---
+# Tacotron 2 with Guided Attention trained on LJSpeech (En)
+This repository provides a pretrained [Tacotron2](https://arxiv.org/abs/1712.05884) trained with [Guided Attention](https://arxiv.org/abs/1710.08969) on LJSpeech dataset (Eng). For a detail of the model, we encourage you to read more about
+[TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS).
+## Install TensorFlowTTS
+First of all, please install SpeechBrain with the following command:
+```
+pip install TensorFlowTTS
+```
+### Converting your Text to Mel Spectrogram
+```python
+from tensorflow_tts.inference import AutoProcessor
+from tensorflow_tts.inference import TFAutoModel
+ljspeech_processor = AutoProcessor.from_pretrained("tensorspeech/tts-tacotron2-ljspeech-en")
+tacotron2 = TFAutoModel.from_pretrained("tensorspeech/tts-tacotron2-ljspeech-en")
+text = "This is a demo to show how to use our model to generate mel spectrogram from raw text."
+input_ids = processor.text_to_sequence(text)
+decoder_output, mel_outputs, stop_token_prediction, alignment_history = tacotron2.inference(
+    input_ids=tf.expand_dims(tf.convert_to_tensor(input_ids, dtype=tf.int32), 0),
+    input_lengths=tf.convert_to_tensor([len(input_ids)], tf.int32),
+    speaker_ids=tf.convert_to_tensor([0], dtype=tf.int32),
+)
+```
+#### Referencing Tacotron 2
+```
+@article{DBLP:journals/corr/abs-1712-05884,
+  author    = {Jonathan Shen and
+               Ruoming Pang and
+               Ron J. Weiss and
+               Mike Schuster and
+               Navdeep Jaitly and
+               Zongheng Yang and
+               Zhifeng Chen and
+               Yu Zhang and
+               Yuxuan Wang and
+               R. J. Skerry{-}Ryan and
+               Rif A. Saurous and
+               Yannis Agiomyrgiannakis and
+               Yonghui Wu},
+  title     = {Natural {TTS} Synthesis by Conditioning WaveNet on Mel Spectrogram
+               Predictions},
+  journal   = {CoRR},
+  volume    = {abs/1712.05884},
+  year      = {2017},
+  url       = {http://arxiv.org/abs/1712.05884},
+  archivePrefix = {arXiv},
+  eprint    = {1712.05884},
+  timestamp = {Thu, 28 Nov 2019 08:59:52 +0100},
+  biburl    = {https://dblp.org/rec/journals/corr/abs-1712-05884.bib},
+  bibsource = {dblp computer science bibliography, https://dblp.org}
+}
+```
+#### Referencing TensorFlowTTS
+```
+@misc{TFTTS,
+    author = {Minh Nguyen, Alejandro Miguel Velasquez, Erogol, Kuan Chen, Dawid Kobus, Takuya Ebata,
+    Trinh Le and Yunchao He},
+    title = {TensorflowTTS},
+    year = {2020},
+    publisher = {GitHub},
+    journal = {GitHub repository},
+    howpublished = {\\url{https://github.com/TensorSpeech/TensorFlowTTS}},
+  }
+```