Yanzhang He
Titel
Citeras av
Citeras av
År
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
2032019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
712019
Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition
K Han, Y He, D Bagchi, E Fosler-Lussier, DL Wang
INTERSPEECH 2015, 2015
572015
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition
D Bagchi, MI Mandel, Z Wang, Y He, A Plummer, E Fosler-Lussier
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
492015
Streaming small-footprint keyword spotting using sequence-to-sequence models
Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
472017
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency
TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
442020
Two-pass end-to-end speech recognition
TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ...
arXiv preprint arXiv:1908.10992, 2019
392019
Conditional random fields in speech, audio, and language processing
E Fosler-Lussier, Y He, P Jyothi, R Prabhavalkar
Proceedings of the IEEE 101 (5), 1054-1075, 2013
362013
Subword-based modeling for handling OOV words in keyword spotting
Y He, B Hutchinson, P Baumann, M Ostendorf, E Fosler-Lussier, ...
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International …, 2014
302014
Towards fast and accurate streaming end-to-end asr
B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
292020
Efficient Segmental Conditional Random Fields for Phone Recognition
Y He, E Fosler-Lussier
13th Annual Conference of the International Speech Communication Association …, 2012
22*2012
Using pronunciation-based morphological subword units to improve OOV handling in keyword search
Y He, P Baumann, H Fang, B Hutchinson, A Jaech, M Ostendorf, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (1), 79-92, 2015
112015
Segmental Conditional Random Fields with Deep Neural Networks as Acoustic Models for First-Pass Word Recognition
Y He, E Fosler-Lussier
INTERSPEECH 2015, 2015
112015
Joint endpointing and decoding with end-to-end models
SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
102019
IMPROVEMENTS ON TRANSDUCING SYLLABLE LATTICE TO WORD LATTICE FOR KEYWORD SEARCH
H Su, VT Pham, Y He, J Hieronymus
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015
102015
Speech Information Processing: Theory and Applications [Scanning the Issue]
D O'shaughnessy, L Deng, H Li
Proceedings of the IEEE 101 (5), 1034-1037, 2013
92013
SYLLABLE BASED KEYWORD SEARCH: TRANSDUCING SYLLABLE LATTICES TO WORD LATTICES
H Su, J Hieronymus, Y He, E Fosler-Lussier, S Wegmann
2014 IEEE Spoken Language Technology Workshop (SLT 2014), 2014
82014
Low latency speech recognition using end-to-end prefetching
SY Chang, B Li, D Rybach, Y He, W Li, T Sainath, T Strohman
Proc. of Interspeech, 2020
42020
An attention-based joint acoustic and text on-device end-to-end model
TN Sainath, R Pang, RJ Weiss, Y He, C Chiu, T Strohman
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
32020
Segmental Models with an Exploration of Acoustic and Lexical Grouping in Automatic Speech Recognition
Y He
The Ohio State University, 2015
32015
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20