| gptkbp:instanceOf | gptkb:speech_synthesis_system 
 | 
                        
                            
                                | gptkbp:application | text-to-speech 
 | 
                        
                            
                                | gptkbp:arXivID | 1712.05884 
 | 
                        
                            
                                | gptkbp:author | gptkb:Navdeep_Jaitly gptkb:Mike_Schuster
 gptkb:RJ_Skerry-Ryan
 gptkb:Rif_A._Saurous
 gptkb:Ron_J._Weiss
 gptkb:Yannis_Agiomyrgiannakis
 gptkb:Yonghui_Wu
 gptkb:Zhifeng_Chen
 gptkb:Zongheng_Yang
 Yuxuan Wang
 Jonathan Shen
 Ruoming Pang
 Yu Zhang
 
 | 
                        
                            
                                | gptkbp:category | gptkb:artificial_intelligence gptkb:public_speaker
 deep learning
 
 | 
                        
                            
                                | gptkbp:citation | 2017 Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
 
 | 
                        
                            
                                | gptkbp:component | gptkb:convolutional_neural_network gptkb:recurrent_neural_network
 attention mechanism
 
 | 
                        
                            
                                | gptkbp:developedBy | gptkb:Google 
 | 
                        
                            
                                | gptkbp:github | https://github.com/NVIDIA/tacotron2 
 | 
                        
                            
                                | gptkbp:influenced | gptkb:Glow-TTS FastSpeech
 Parallel Tacotron
 
 | 
                        
                            
                                | gptkbp:input | gptkb:text character sequence
 
 | 
                        
                            
                                | gptkbp:language | English 
 | 
                        
                            
                                | gptkbp:notableFor | end-to-end text-to-speech synthesis high-quality natural speech
 
 | 
                        
                            
                                | gptkbp:openSource | yes 
 | 
                        
                            
                                | gptkbp:output | speech waveform 
 | 
                        
                            
                                | gptkbp:outputRepresentation | mel spectrogram 
 | 
                        
                            
                                | gptkbp:predecessor | gptkb:Tacotron 
 | 
                        
                            
                                | gptkbp:publishedIn | gptkb:arXiv 
 | 
                        
                            
                                | gptkbp:releaseYear | 2017 
 | 
                        
                            
                                | gptkbp:trainer | LJSpeech VCTK
 paired text and speech
 
 | 
                        
                            
                                | gptkbp:uses | sequence-to-sequence model WaveNet vocoder
 
 | 
                        
                            
                                | gptkbp:bfsParent | gptkb:Tacotron 
 | 
                        
                            
                                | gptkbp:bfsLayer | 7 
 | 
                        
                            
                                | https://www.w3.org/2000/01/rdf-schema#label | Tacotron 2 
 |