End-to-end speech recognition with adaptive computation steps M Li, M Liu, H Masanori ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 39 | 2019 |
Transformer-based online speech recognition with decoder-end adaptive computation steps M Li, C Zorilă, R Doddipatla 2021 IEEE spoken language technology workshop (SLT), 1-7, 2021 | 22 | 2021 |
Head-synchronous decoding for transformer-based streaming asr M Li, C Zorilă, R Doddipatla ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Non-autoregressive end-to-end approaches for joint automatic speech recognition and spoken language understanding M Li, R Doddipatla 2022 IEEE Spoken Language Technology Workshop (SLT), 390-397, 2023 | 9 | 2023 |
Transformer-based streaming ASR with cumulative attention M Li, S Zhang, C Zorilă, R Doddipatla ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results. M Li, Y Cao, W Zhou, M Liu Interspeech, 1641-1645, 2019 | 6 | 2019 |
An investigation into the multi-channel time domain speaker extraction network C Zorilă, M Li, R Doddipatla 2021 IEEE Spoken Language Technology Workshop (SLT), 793-800, 2021 | 4 | 2021 |
Multiple-hypothesis RNN-T Loss for unsupervised fine-tuning and self-training of neural transducer CT Do, M Li, R Doddipatla arXiv preprint arXiv:2207.14736, 2022 | 3 | 2022 |
Toshiba’s speech recognition system for the CHiME 2020 challenge C Zorila, M Li, D Hayakawa, M Liu, N Ding, R Doddipatla Proc. of The 6th Intl. Workshop on Speech Processing in Everyday …, 2020 | 3 | 2020 |
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding M Li, S Keizer, R Doddipatla arXiv preprint arXiv:2406.15209, 2024 | 2 | 2024 |
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs M Li, C Zorilă, CT Do, R Doddipatla 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 2 | 2023 |
Improving HS-DACS based streaming Transformer ASR with deep reinforcement learning M Li, R Doddipatla 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 2 | 2021 |
DiaLoc: An Iterative Approach to Embodied Dialog Localization C Zhang, M Li, I Budvytis, S Liwicki Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring M Li, CT Do, R Doddipatla ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Towards a speaker diarization system for the CHiME 2020 dinner party transcription C Boeddeker, T Cord-Landwehr, J Heitkaemper, C Zorila, D Hayakawa, ... Proc. 6th International Workshop on Speech Processing in Everyday …, 2020 | 1 | 2020 |
Domain Adaptive Self-supervised Training of Automatic Speech Recognition CT Do, R Doddipatla, M Li, T Hain | 1 | |
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding M Li, CT Do, S Keizer, Y Farag, S Stoyanchev, R Doddipatla arXiv preprint arXiv:2408.16423, 2024 | | 2024 |
Speech recognition systems and methods LI Mohan, T Zorila, RS Doddipatla US Patent 12,002,450, 2024 | | 2024 |
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition M Li, R Doddipatla, C Zorila Proc. Interspeech 2022, 2088-2092, 2022 | | 2022 |