Overcoming classifier imbalance for long-tail object detection with balanced group softmax Y Li, T Wang, B Kang, S Tang, C Wang, J Li, J Feng Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 278 | 2020 |
COME for no-reference video quality assessment C Wang, L Su, W Zhang 2018 IEEE Conference on Multimedia Information Processing and Retrieval …, 2018 | 24 | 2018 |
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ... arXiv preprint arXiv:2306.03509, 2023 | 22 | 2023 |
Webpage saliency prediction with multi-features fusion J Li, L Su, B Wu, J Pang, C Wang, Z Wu, Q Huang 2016 IEEE International Conference on Image Processing (ICIP), 674-678, 2016 | 17 | 2016 |
Fine-Grained Prosody Modeling in Neural Speech Synthesis Using ToBI Representation. Y Zou, S Liu, X Yin, H Lin, C Wang, H Zhang, Z Ma Interspeech, 3146-3150, 2021 | 14 | 2021 |
CNN-MR for no reference video quality assessment C Wang, L Su, Q Huang 2017 4th International Conference on Information Science and Control …, 2017 | 13 | 2017 |
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts Z Jiang, J Liu, Y Ren, J He, C Zhang, Z Ye, P Wei, C Wang, X Yin, Z Ma, ... arXiv preprint arXiv:2307.07218, 2023 | 9 | 2023 |
Styles2st: Zero-shot style transfer for direct speech-to-speech translation K Song, Y Ren, Y Lei, C Wang, K Wei, L Xie, X Yin, Z Ma arXiv preprint arXiv:2305.17732, 2023 | 3 | 2023 |
Saliency detection with two-level fully convolutional networks Y Yi, L Su, Q Huang, Z Wu, C Wang 2017 IEEE International Conference on Multimedia and Expo (ICME), 271-276, 2017 | 3 | 2017 |
基于卷积神经网络的时空融合的无参考视频质量评价方法 王春峰, 苏荔, 黄庆明 中国科学院大学学报 35 (4), 544, 2018 | 1 | 2018 |
基于 3D 卷积神经网络的无参考视频质量评价 王春峰, 苏荔, 张维刚, 黄庆明 软件学报 27 (S2), 103-112, 2017 | 1 | 2017 |
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech Y Cong, H Zhang, H Lin, S Liu, C Wang, Y Ren, X Yin, Z Ma arXiv preprint arXiv:2306.15304, 2023 | | 2023 |
LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion C Wang, P Huang, Y Zou, H Zhang, S Liu, X Yin, Z Ma ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |