Technical Program

SP-P6: Speech Synthesis using Neural Networks I

Session Type: Poster
Time: Wednesday, March 23, 13:30 - 15:30
Location: Poster Area H
Session Chair: Frank Soong, Microsoft
 
SP-P6.1: FROM HMMS TO DNNS: WHERE DO THE IMPROVEMENTS COME FROM?
         Oliver Watts; University of Edinburgh
         Gustav Eje Henter; University of Edinburgh
         Thomas Merritt; University of Edinburgh
         Zhizheng Wu; University of Edinburgh
         Simon King; University of Edinburgh
 
SP-P6.2: DEEP BELIEF NETWORK-BASED POST-FILTERING FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
         Ya-Jun Hu; University of Science and Technology of China
         Zhen-Hua Ling; University of Science and Technology of China
         Li-Rong Dai; University of Science and Technology of China
 
SP-P6.3: A KL DIVERGENCE AND DNN APPROACH TO CROSS-LINGUAL TTS
         Feng-Long Xie; Harbin Institute of Technology
         Frank K. Soong; Microsoft Research Asia
         Haifeng Li; Harbin Institute of Technology
 
SP-P6.4: GATING RECURRENT MIXTURE DENSITY NETWORKS FOR ACOUSTIC MODELING IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS
         Wenfu Wang; Institute of Automation, Chinese Academy of Sciences
         Shuang Xu; Institute of Automation, Chinese Academy of Sciences
         Bo Xu; Institute of Automation, Chinese Academy of Sciences
 
SP-P6.5: WAVELET-BASED DECOMPOSITION OF F0 AS A SECONDARY TASK FOR DNN-BASED SPEECH SYNTHESIS WITH MULTI-TASK LEARNING
         Manuel Sam Ribeiro; The Centre for Speech Technology Research, University of Edinburgh
         Oliver Watts; The Centre for Speech Technology Research, University of Edinburgh
         Junichi Yamagishi; The Centre for Speech Technology Research, University of Edinburgh
         Robert A. J. Clark; The Centre for Speech Technology Research, University of Edinburgh
 
SP-P6.6: SPEAKER ADAPTIVE MODEL BASED ON BOLTZMANN MACHINE FOR NON-PARALLEL TRAINING IN VOICE CONVERSION
         Toru Nakashika; The University of Electro-Communications
         Yasuhiro Minami; The University of Electro-Communications
 
SP-P6.7: A DEEP AUTO-ENCODER BASED LOW-DIMENSIONAL FEATURE EXTRACTION FROM FFT SPECTRAL ENVELOPES FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
         Shinji Takaki; National Institute of Informatics
         Junichi Yamagishi; National Institute of Informatics
 
SP-P6.8: SPEAKER AND LANGUAGE FACTORIZATION IN DNN-BASED TTS SYNTHESIS
         Yuchen Fan; Microsoft Corporation
         Yao Qian; Educational Testing Service Research
         Frank K. Soong; Microsoft Corporation
         Lei He; Microsoft Corporation
 
SP-P6.9: LEARNING CROSS-LINGUAL INFORMATION WITH MULTILINGUAL BLSTM FOR SPEECH SYNTHESIS OF LOW-RESOURCE LANGUAGES
         Quanjie Yu; Tsinghua University
         Peng Liu; Tsinghua University
         Zhiyong Wu; Tsinghua University
         Shiyin Kang; The Chinese University of Hong Kong
         Helen Meng; The Chinese University of Hong Kong
         Lianhong Cai; Tsinghua University
 
SP-P6.10: LOW LEVEL DESCRIPTORS BASED DBLSTM BOTTLENECK FEATURE FOR SPEECH DRIVEN TALKING AVATAR
         Xinyu Lan; Tsinghua University
         Xu Li; Tsinghua University
         Yishuang Ning; Tsinghua University
         Zhiyong Wu; Tsinghua University
         Helen Meng; The Chinese University of Hong Kong
         Jia Jia; Tsinghua University
         Lianhong Cai; Tsinghua University
 

ICASSP 2016 Patrons