Technical Program
SP-L7: Speech Synthesis Using Neural Networks II |
Session Type: Lecture |
Time: Thursday, March 24, 13:30 - 15:30 |
Location: Yangtse River Hall (5F) |
Session Chairs: Heiga Zen, Google UK Ltd. and Raul Fernandez, IBM Watson |
SP-L7.1: HIGH-PITCHED EXCITATION GENERATION FOR GLOTTAL VOCODING IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING A DEEP NEURAL NETWORK |
Lauri Juvela; Aalto University |
Bajibabu Bollepalli; Aalto University |
Manu Airaksinen; Aalto University |
Paavo Alku; Aalto University |
SP-L7.2: MODELING SPECTRAL ENVELOPES USING DEEP CONDITIONAL RESTRICTED BOLTZMANN MACHINES FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS |
Xiang Yin; University of Science and Technology of China |
Zhen-Hua Ling; University of Science and Technology of China |
Ya-Jun Hu; University of Science and Technology of China |
Li-Rong Dai; University of Science and Technology of China |
SP-L7.3: ROBUST TTS DURATION MODELLING USING DNNS |
Gustav Eje Henter; The University of Edinburgh |
Srikanth Ronanki; The University of Edinburgh |
Oliver Watts; The University of Edinburgh |
Mirjam Wester; The University of Edinburgh |
Zhizheng Wu; The University of Edinburgh |
Simon King; The University of Edinburgh |
SP-L7.4: UNSUPERVISED SPEAKER ADAPTATION FOR DNN-BASED TTS SYNTHESIS |
Yuchen Fan; Microsoft Corporation |
Yao Qian; Educational Testing Service Research |
Frank K. Soong; Microsoft Corporation |
Lei He; Microsoft Corporation |
SP-L7.5: INVESTIGATING GATED RECURRENT NETWORKS FOR SPEECH SYNTHESIS |
Zhizheng Wu; University of Edinburgh |
Simon King; University of Edinburgh |
SP-L7.6: DEEP NEURAL NETWORK-GUIDED UNIT SELECTION SYNTHESIS |
Thomas Merritt; University of Edinburgh |
Robert A. J. Clark; University of Edinburgh |
Zhizheng Wu; University of Edinburgh |
Junichi Yamagishi; University of Edinburgh |
Simon King; University of Edinburgh |