Singing Voice Synthesis Based on Deep Neural Networks. M Nishimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Interspeech, 2478-2482, 2016 | 82 | 2016 |
The effect of neural networks in statistical parametric speech synthesis K Hashimoto, K Oura, Y Nankaku, K Tokuda 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 60 | 2015 |
Singing voice synthesis based on generative adversarial networks Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 44 | 2019 |
Trajectory training considering global variance for speech synthesis based on neural networks K Hashimoto, K Oura, Y Nankaku, K Tokuda 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 36 | 2016 |
Privacy-preserving sound to degrade automatic speaker verification performance K Hashimoto, J Yamagishi, I Echizen 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 31 | 2016 |
Singing voice synthesis based on convolutional neural networks K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda arXiv preprint arXiv:1904.06868, 2019 | 29 | 2019 |
Recent development of the DNN-based singing voice synthesis system—sinsy Y Hono, S Murata, K Nakamura, K Hashimoto, K Oura, Y Nankaku, ... 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 25 | 2018 |
Statistical voice conversion based on WaveNet J Niwa, T Yoshimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 23 | 2018 |
Temporal modeling in neural network based statistical parametric speech synthesis. K Tokuda, K Hashimoto, K Oura, Y Nankaku SSW, 106-111, 2016 | 20 | 2016 |
Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition K Hashimoto, H Zen, Y Nankaku, A Lee, K Tokuda Ninth Annual Conference of the International Speech Communication Association, 2008 | 20 | 2008 |
Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis K Shirota, K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 19 | 2014 |
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Fifteenth Annual Conference of the International Speech Communication …, 2014 | 19 | 2014 |
Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis T Fujimoto, K Hashimoto, K Oura, Y Nankaku, K Tokuda 10th ISCA Speech Synthesis Workshop. ISCA, Vienna, Austria, 2019 | 17 | 2019 |
Hierarchical multi-grained generative model for expressive speech synthesis Y Hono, K Tsuboi, K Sawada, K Hashimoto, K Oura, Y Nankaku, ... arXiv preprint arXiv:2009.08474, 2020 | 16 | 2020 |
Impacts of machine translation and speech synthesis on speech-to-speech translation K Hashimoto, J Yamagishi, W Byrne, S King, K Tokuda Speech Communication 54 (7), 857-866, 2012 | 16 | 2012 |
Mel-cepstrum-based quantization noise shaping applied to neural-network-based speech waveform synthesis T Yoshimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (7), 1177 …, 2018 | 15 | 2018 |
Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing. R Dall, K Hashimoto, K Oura, Y Nankaku, K Tokuda INTERSPEECH, 2851-2855, 2016 | 15 | 2016 |
A Bayesian approach to HMM-based speech synthesis K Hashimoto, H Zen, Y Nankaku, T Masuko, K Tokuda 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 14 | 2009 |
Fast and high-quality singing voice synthesis system based on convolutional neural networks K Nakamura, S Takaki, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
A Bayesian approach to hidden semi-Markov model based speech synthesis K Hashimoto, Y Nankaku, K Tokuda Tenth Annual Conference of the International Speech Communication Association, 2009 | 13 | 2009 |