State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 671 | 2018 |
Specaugment: A simple data augmentation method for automatic speech recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le arXiv preprint arXiv:1904.08779, 2019 | 647 | 2019 |
Monotonic chunkwise attention CC Chiu, C Raffel arXiv preprint arXiv:1712.05382, 2017 | 110 | 2017 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 73 | 2019 |
Minimum word error rate training for attention-based sequence-to-sequence models R Prabhavalkar, TN Sainath, Y Wu, P Nguyen, Z Chen, CC Chiu, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 73 | 2018 |
A comparison of techniques for language model integration in encoder-decoder speech recognition S Toshniwal, A Kannan, CC Chiu, Y Wu, TN Sainath, K Livescu 2018 IEEE spoken language technology workshop (SLT), 369-375, 2018 | 65 | 2018 |
Leveraging weakly supervised data to improve end-to-end speech-to-text translation Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 63 | 2019 |
How to train your avatar: A data driven approach to gesture generation CC Chiu, S Marsella International Workshop on Intelligent Virtual Agents, 127-140, 2011 | 62 | 2011 |
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 51 | 2020 |
Monotonic infinite lookback attention for simultaneous machine translation N Arivazhagan, C Cherry, W Macherey, CC Chiu, S Yavuz, R Pang, W Li, ... arXiv preprint arXiv:1906.05218, 2019 | 51 | 2019 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 49 | 2020 |
Predicting co-verbal gestures: a deep and temporal modeling approach CC Chiu, LP Morency, S Marsella International Conference on Intelligent Virtual Agents, 152-166, 2015 | 47 | 2015 |
Two-pass end-to-end speech recognition TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ... arXiv preprint arXiv:1908.10992, 2019 | 42 | 2019 |
Learning online alignments with continuous rewards policy gradient Y Luo, CC Chiu, N Jaitly, I Sutskever 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 39 | 2017 |
Gesture generation with low-dimensional embeddings CC Chiu, S Marsella Proceedings of the 2014 international conference on Autonomous agents and …, 2014 | 39 | 2014 |
Improving the performance of online neural transducer models TN Sainath, CC Chiu, R Prabhavalkar, A Kannan, Y Wu, P Nguyen, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 37 | 2018 |
No need for a lexicon? evaluating the value of the pronunciation lexica in end-to-end models TN Sainath, R Prabhavalkar, S Kumar, S Lee, A Kannan, D Rybach, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 34 | 2018 |
Speech recognition for medical conversations CC Chiu, A Tripathi, K Chou, C Co, N Jaitly, D Jaunzeikare, A Kannan, ... arXiv preprint arXiv:1711.07274, 2017 | 30 | 2017 |
ContextNet: Improving convolutional neural networks for automatic speech recognition with global context W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu arXiv preprint arXiv:2005.03191, 2020 | 27 | 2020 |
Specaugment on large scale datasets DS Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, QV Le, Y Wu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 27 | 2020 |