Follow
Xiong Xiao
Xiong Xiao
Principal Applied scientist, Microsoft
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
9092022
A learning-based approach to direction of arrival estimation in noisy and reverberant environments
X Xiao, S Zhao, X Zhong, DL Jones, ES Chng, H Li
2015 IEEE international conference on acoustics, speech and signal …, 2015
3102015
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2092016
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1902020
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge.
X Xiao, X Tian, S Du, H Xu, E Chng, H Li
Interspeech, 2052-2056, 2015
1562015
Synthetic speech detection using temporal modulation feature
Z Wu, X Xiao, ES Chng, H Li
2013 IEEE international conference on acoustics, speech and signal …, 2013
1432013
Multi-channel overlapped speech recognition with location guided speech extraction network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong
2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018
1302018
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition
X Xiao, S Zhao, DL Jones, ES Chng, H Li
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
1022017
Unified architecture for multichannel end-to-end speech recognition with neural beamforming
T Ochiai, S Watanabe, T Hori, JR Hershey, X Xiao
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1274-1288, 2017
982017
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...
US Patent 10,867,610, 2020
942020
Recognizing overlapped speech in meetings: A multichannel separation approach using neural networks
T Yoshioka, H Erdogan, Z Chen, X Xiao, F Alleva
arXiv preprint arXiv:1810.03655, 2018
942018
Normalization of the speech modulation spectra for robust speech recognition
X Xiao, ES Chng, H Li
IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1662-1674, 2008
912008
Advances in online audio-visual meeting transcription
T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
832019
Single channel speech separation with constrained utterance level permutation invariant training using grid lstm
C Xu, W Rao, X Xiao, ES Chng, H Li
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
812018
Single-channel speech extraction using speaker inventory and attention network
X Xiao, Z Chen, T Yoshioka, H Erdogan, C Liu, D Dimitriadis, J Droppo, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
732019
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020
X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
672021
Developing far-field speaker system via teacher-student learning
J Li, R Zhao, Z Chen, C Liu, X Xiao, G Ye, Y Gong
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
672018
MASS: A Malay language LVCSR corpus resource
TP Tan, X Xiao, EK Tang, ES Chng, H Li
2009 Oriental COCOSDA International Conference on Speech Database and …, 2009
632009
Multi-channel speech separation
Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong
US Patent 10,839,822, 2020
622020
Spoofing detection from a feature representation perspective
X Tian, Z Wu, X Xiao, ES Chng, H Li
2016 IEEE International conference on acoustics, speech and signal …, 2016
592016
The system can't perform the operation now. Try again later.
Articles 1–20