Skip to content

Commit 2a2735f

Browse files
Merge pull request NVIDIA#935 from alancucki/readme-update
[FastPitch/PyT] Update model description
2 parents 9794268 + 0c04f3c commit 2a2735f

File tree

1 file changed

+3
-5
lines changed

1 file changed

+3
-5
lines changed

PyTorch/SpeechSynthesis/FastPitch/README.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -48,17 +48,15 @@ This repository provides a script and recipe to train the FastPitch model to ach
4848

4949
## Model overview
5050

51-
[FastPitch](https://arxiv.org/abs/2006.06873) is one of two major components in a neural, text-to-speech (TTS) system:
51+
[FastPitch](https://arxiv.org/abs/2006.06873) is a fully-parallel transformer architecture with prosody control over pitch and individual phoneme duration.
52+
It is one of two major components in a neural, text-to-speech (TTS) system:
5253

5354
* a mel-spectrogram generator such as [FastPitch](https://arxiv.org/abs/2006.06873) or [Tacotron 2](https://arxiv.org/abs/1712.05884), and
5455
* a waveform synthesizer such as [WaveGlow](https://arxiv.org/abs/1811.00002) (see [NVIDIA example code](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2)).
5556

5657
Such two-component TTS system is able to synthesize natural sounding speech from raw transcripts.
5758

58-
The FastPitch model generates mel-spectrograms and predicts a pitch contour from raw input text. It allows to exert additional control over the synthesized utterances, such as:
59-
* modify the pitch contour to control the prosody,
60-
* increase or decrease the fundamental frequency in a naturally sounding way, that preserves the perceived identity of the speaker,
61-
* alter the pace of speech.
59+
The FastPitch model generates mel-spectrograms and predicts a pitch contour from raw input text.
6260
Some of the capabilities of FastPitch are presented on the website with [samples](https://fastpitch.github.io/).
6361

6462
Speech synthesized with FastPitch has state-of-the-art quality, and does not suffer from missing/repeating phrases like Tacotron2 does.

0 commit comments

Comments
 (0)