Följ
Yi Zhao
Titel
Citeras av
Citeras av
År
Voice conversion challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Y Zhao, WC Huang, X Tian, J Yamagishi, RK Das, T Kinnunen, Z Ling, ...
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion …, 2020
1962020
Wasserstein GAN and waveform loss-based acoustic model training for multi-speaker text-to-speech synthesis systems using a WaveNet vocoder
Y Zhao, S Takaki, HT Luong, J Yamagishi, D Saito, N Minematsu
IEEE access 6, 60478-60488, 2018
692018
Predictions of subjective ratings and spoofing assessments of voice conversion challenge 2020 submissions
RK Das, T Kinnunen, WC Huang, Z Ling, J Yamagishi, Y Zhao, X Tian, ...
Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020
552020
Melons: generating melody with long-term structure using transformers and structure graph
Y Zou, P Zou, Y Zhao, K Zhang, R Zhang, X Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
282022
Speaker representations for speaker adaptation in multiple speakers blstm-rnn-based speech synthesis
Y Zhao, D Saito, N Minematsu
Interspeech 2016, 2016
242016
Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm
J Williams, Y Zhao, E Cooper, J Yamagishi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
202021
Transferring neural speech waveform synthesizers to musical instrument sounds generation
Y Zhao, X Wang, L Juvela, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
192020
Improved prosody from learned F0 codebook representations for VQ-VAE speech waveform reconstruction
Y Zhao, H Li, CI Lai, J Williams, E Cooper, J Yamagishi
Interspeech 2020, 2019
192019
Fusion of self-supervised learned models for MOS prediction
Z Yang, W Zhou, C Chu, S Li, R Dabre, R Rubino, Y Zhao
arXiv preprint arXiv:2204.04855, 2022
132022
Pretraining strategies, waveform model choice, and acoustic configurations for multi-speaker end-to-end speech synthesis
E Cooper, X Wang, Y Zhao, Y Yasuda, J Yamagishi
arXiv preprint arXiv:2011.04839, 2020
62020
Does the lombard effect improve emotional communication in noise?-analysis of emotional speech acted in noise
Y Zhao, A Ando, S Takaki, J Yamagishi, S Kobashikawa
arXiv preprint arXiv:1903.12316, 2019
62019
System description for voice privacy challenge 2022
X Chen, G Li, H Huang, W Zhou, S Li, Y Cao, Y Zhao
Proc. 2nd Symposium on Security and Privacy in Speech Communication, 2022
52022
A study on BLSTM-RNN-based Chinese prosodic structure prediction in a unified framework with character-level features
Y Zhao, C Ding, N Minematsu, D Saito
Proceedings of the Speech Prosody Conference, 2016
42016
Improved Algorithm for Pitch Detection and Harmonic Separation
Y Zhao, S Zhang, XK Lin
Applied Mechanics and Materials 333, 753-763, 2013
22013
Two methods of design and implementation of ACELP vocoder
Y Zhao, S Zhang, X Lin
2013 IEEE International Conference on Signal Processing, Communication and …, 2013
22013
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
W Zhou, Z Yang, C Chu, S Li, R Dabre, Y Zhao, K Tatsuya
arXiv preprint arXiv:2401.13249, 2024
12024
The UTokyo system for Blizzard Challenge 2016
Y Zhao, X You, D Saito, N Minematsu
Proc. Blizzard Challenge Workshop, 2016
12016
On the design of digital base-band processing unit for dPMR system
Z Yang, Y Zhao, X Lin
2013 15th IEEE International Conference on Communication Technology, 645-649, 2013
12013
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation
Y Bai, C Li, H Li, Y Zhao, X Wang
arXiv preprint arXiv:2404.11275, 2024
2024
GFMAE: Self-Supervised GNN-Free Masked Autoencoders
Y Hu, S Ouyang, Z Yang, Y Zhao, J Wan, F Zhang, Z Wang, Y Liu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20