Följ
Jinyu Li
Jinyu Li
Partner Applied Science Manager, Microsoft
Verifierad e-postadress på microsoft.com - Startsida
Titel
Citeras av
Citeras av
År
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
16622022
Recent advances in deep learning for speech research at Microsoft
L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ...
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013
10652013
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
JT Huang, J Li, D Yu, L Deng, Y Gong
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
7912013
An overview of noise-robust automatic speech recognition
J Li, L Deng, Y Gong, R Haeb-Umbach
IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (4), 745-777, 2014
6892014
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
5692023
Restructuring of deep neural network acoustic models with singular value decomposition.
J Xue, J Li, Y Gong
INTERSPEECH, 2365-2369, 2013
5272013
Recent advances in end-to-end automatic speech recognition
J Li
APSIPA Transactions on Signal and Information Processing 11 (1), 2022
4032022
Learning small-size DNN with output-distribution-based criteria.
J Li, R Zhao, JT Huang, Y Gong
INTERSPEECH, 1910-1914, 2014
3602014
Feature learning in deep neural networks-studies on speech recognition tasks
D Yu, ML Seltzer, J Li, JT Huang, F Seide
arXiv preprint arXiv:1301.3605, 2013
3242013
Restructuring deep neural network acoustic models
J Xue, E Stoimenov, J Li, Y Gong
US Patent 9,728,184, 2017
2592017
Restructuring deep neural network acoustic models
J Xue, E Stoimenov, J Li, Y Gong
US Patent 9,728,184, 2017
2592017
Shared hidden layer combination for speech recognition systems
J Li, J Xue, Y Gong
US Patent 9,520,127, 2016
2412016
Shared hidden layer combination for speech recognition systems
J Li, J Xue, Y Gong
US Patent 9,520,127, 2016
2412016
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing
J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ...
arXiv preprint arXiv:2110.07205, 2021
2372021
Variable-component deep neural network for robust speech recognition
J Li, R Zhao, Y Gong
US Patent 10,019,990, 2018
2272018
Continuous speech separation: dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2252020
Developing real-time streaming transformer transducer for speech recognition on large-scale dataset
X Chen, Y Wu, Z Wang, S Liu, J Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2102021
Robust Automatic Speech Recognition: A Bridge to Practical Applications
J Li, L Deng, R Haeb-Umbach, Y Gong
Academic Press, 2015
2092015
End-to-End attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
Spoken Language Technology Workshop (SLT), 2016 IEEE, 171-178, 2016
2082016
Improving RNN Transducer Modeling for End-to-End Speech Recognition
J Li, R Zhao, H Hu, Y Gong
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019
2052019
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20