Yi Zhao

Citeras av

	Alla	Sedan 2019
Citat	471	453
h-index	9	9
i10-index	9	9

140

105

201720182019202020212022202320247 10 22 57 99 104 134 34

Offentlig åtkomst

Visa alla

1 artikel

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Junichi YamagishiNational Institute of Informatics, Tokyo, JapanVerifierad e-postadress på nii.ac.jp
Rohan Kumar DasFortemedia SingaporeVerifierad e-postadress på ieee.org
Xiaohai TianNational University of Singapore (NUS)Verifierad e-postadress på nus.edu.sg
Zhen-Hua Ling（凌震华）Professor, University of Science and Technology of ChinaVerifierad e-postadress på ustc.edu.cn
Tomi KinnunenProfessor, University of Eastern FinlandVerifierad e-postadress på uef.fi
Wen-Chin HuangNagoya UniversityVerifierad e-postadress på g.sp.m.is.nagoya-u.ac.jp
Tomoki TodaNagoya UniversityVerifierad e-postadress på icts.nagoya-u.ac.jp
Nobuaki MinematsuThe University of TokyoVerifierad e-postadress på gavo.t.u-tokyo.ac.jp
Jennifer WilliamsAssistant Professor at University of Southampton (UK)Verifierad e-postadress på soton.ac.uk
Hieu-Thi LuongNanyang Technological UniversityVerifierad e-postadress på ntu.edu.sg
Lauri JuvelaAssistant Professor, Machine Learning in Speech and Language Technology, Aalto UniversityVerifierad e-postadress på aalto.fi
Cheng-I Jeff LaiMassachusetts Institute of TechnologyVerifierad e-postadress på mit.edu
Xin WangNational Institute of InformaticsVerifierad e-postadress på nii.ac.jp
Haoyu LiNational Institute of InformaticsVerifierad e-postadress på nii.ac.jp
Atsushi AndoNTT CorporationVerifierad e-postadress på hco.ntt.co.jp
Erica CooperNational Institute of Information and Communications Technology

Följ

Yi Zhao

National Institute of Informatics, Japan

Verifierad e-postadress på nii.ac.jp

Speech Audio Artificial Intelligence


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Voice conversion challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Y Zhao, WC Huang, X Tian, J Yamagishi, RK Das, T Kinnunen, Z Ling, ... Proc. Joint workshop for the Blizzard Challenge and Voice Conversion …, 2020	196	2020
Wasserstein GAN and waveform loss-based acoustic model training for multi-speaker text-to-speech synthesis systems using a WaveNet vocoder Y Zhao, S Takaki, HT Luong, J Yamagishi, D Saito, N Minematsu IEEE access 6, 60478-60488, 2018	69	2018
Predictions of subjective ratings and spoofing assessments of voice conversion challenge 2020 submissions RK Das, T Kinnunen, WC Huang, Z Ling, J Yamagishi, Y Zhao, X Tian, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020	55	2020
Melons: generating melody with long-term structure using transformers and structure graph Y Zou, P Zou, Y Zhao, K Zhang, R Zhang, X Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	28	2022
Speaker representations for speaker adaptation in multiple speakers blstm-rnn-based speech synthesis Y Zhao, D Saito, N Minematsu Interspeech 2016, 2016	24	2016
Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm J Williams, Y Zhao, E Cooper, J Yamagishi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	20	2021
Transferring neural speech waveform synthesizers to musical instrument sounds generation Y Zhao, X Wang, L Juvela, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	19	2020
Improved prosody from learned F0 codebook representations for VQ-VAE speech waveform reconstruction Y Zhao, H Li, CI Lai, J Williams, E Cooper, J Yamagishi Interspeech 2020, 2019	19	2019
Fusion of self-supervised learned models for MOS prediction Z Yang, W Zhou, C Chu, S Li, R Dabre, R Rubino, Y Zhao arXiv preprint arXiv:2204.04855, 2022	13	2022
Pretraining strategies, waveform model choice, and acoustic configurations for multi-speaker end-to-end speech synthesis E Cooper, X Wang, Y Zhao, Y Yasuda, J Yamagishi arXiv preprint arXiv:2011.04839, 2020	6	2020
Does the lombard effect improve emotional communication in noise?-analysis of emotional speech acted in noise Y Zhao, A Ando, S Takaki, J Yamagishi, S Kobashikawa arXiv preprint arXiv:1903.12316, 2019	6	2019
System description for voice privacy challenge 2022 X Chen, G Li, H Huang, W Zhou, S Li, Y Cao, Y Zhao Proc. 2nd Symposium on Security and Privacy in Speech Communication, 2022	5	2022
A study on BLSTM-RNN-based Chinese prosodic structure prediction in a unified framework with character-level features Y Zhao, C Ding, N Minematsu, D Saito Proceedings of the Speech Prosody Conference, 2016	4	2016
Improved Algorithm for Pitch Detection and Harmonic Separation Y Zhao, S Zhang, XK Lin Applied Mechanics and Materials 333, 753-763, 2013	2	2013
Two methods of design and implementation of ACELP vocoder Y Zhao, S Zhang, X Lin 2013 IEEE International Conference on Signal Processing, Communication and …, 2013	2	2013
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction W Zhou, Z Yang, C Chu, S Li, R Dabre, Y Zhao, K Tatsuya arXiv preprint arXiv:2401.13249, 2024	1	2024
The UTokyo system for Blizzard Challenge 2016 Y Zhao, X You, D Saito, N Minematsu Proc. Blizzard Challenge Workshop, 2016	1	2016
On the design of digital base-band processing unit for dPMR system Z Yang, Y Zhao, X Lin 2013 15th IEEE International Conference on Communication Technology, 645-649, 2013	1	2013
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation Y Bai, C Li, H Li, Y Zhao, X Wang arXiv preprint arXiv:2404.11275, 2024		2024
GFMAE: Self-Supervised GNN-Free Masked Autoencoders Y Hu, S Ouyang, Z Yang, Y Zhao, J Wan, F Zhang, Z Wang, Y Liu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare