FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao IJCAI 2022, 2022 | 163 | 2022 |
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis MWY Lam, J Wang, D Su, D Yu ICLR 2022, 2022 | 141* | 2022 |
Deep extractor network for target speaker recovery from single channel speech mixtures J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu Interspeech 2018 arXiv preprint arXiv:1807.08974, 2018 | 104 | 2018 |
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu Interspeech, 761-765, 2018 | 63 | 2018 |
Volume leveler controller and controlling method J Wang, L Lu, AJ Seefeldt US Patent 10,411,669, 2019 | 57 | 2019 |
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation MWY Lam, J Wang, D Su, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 54 | 2021 |
Apparatuses and methods for audio classifying and processing L Lu, AJ Seefeldt, J Wang US Patent 9,842,605, 2017 | 44 | 2017 |
Effective low-cost time-domain audio separation using globally attentive locally recurrent networks MWY Lam, J Wang, D Su, D Yu 2021 IEEE Spoken Language Technology Workshop (SLT), 801-808, 2021 | 34 | 2021 |
Boosting for multi-modal music emotion classification Q Lu, X Chen, D Yang, J Wang 11th International Society for Music Information and Retrieval Conference …, 2010 | 34 | 2010 |
Real-time speech/music classification with a hierarchical oblique decision tree J Wang, Q Wu, H Deng, Q Yan 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 34 | 2008 |
Audio processing method and audio processing apparatus, and training method J Wang, L Lu US Patent 9,830,896, 2017 | 31 | 2017 |
Mixup-breakdown: a consistency training method for improving generalization of speech separation models MWY Lam, J Wang, D Su, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Equalizer controller and controlling method L Lu, J Wang, A Seefeldt, M Hu US Patent 9,621,124, 2017 | 22 | 2017 |
Method for generating a surround sound field, apparatus and computer program product thereof X Sun, B Cheng, S Xu, Z Shuang, J Wang US Patent 9,668,080, 2017 | 20 | 2017 |
Adaptive audio content generation J Wang, L Lu, M Hu, DJ Breebaart, NR Tsingos US Patent 9,756,445, 2017 | 17 | 2017 |
Improving attention-based end-to-end ASR systems with sequence-based loss functions J Cui, C Weng, G Wang, J Wang, P Wang, C Yu, D Su, D Yu 2018 IEEE Spoken Language Technology Workshop (SLT), 353-360, 2018 | 16 | 2018 |
Adaptive panner of audio objects J Wang, G Cengarle, JF Torres, D Arteaga US Patent 9,949,052, 2018 | 16 | 2018 |
Predicting high-level music semantics using social tags via on-tology-based reasoning J Wang, X Chen, Y Hu, T Feng INTERNATIONAL SOCIETY FOR MUSIC INFORMATION RETRIEVAL CONFERENCE 11, 9-13, 2010 | 16 | 2010 |
Separating audio sources J Wang US Patent 10,176,826, 2019 | 14 | 2019 |
Audio object extraction M Hu, L Lu, J Wang US Patent 9,786,288, 2017 | 13 | 2017 |