Följ
Haohe Liu
Haohe Liu
University of Surrey, Centre for Vision, Speech, and Signal processing (CVSSP)
Verifierad e-postadress på surrey.ac.uk - Startsida
Titel
Citeras av
Citeras av
År
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
H Liu*, Z Chen*, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
ICML, 2023
2092023
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
X Tan*, J Chen*, H Liu*, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
1032022
Decoupling magnitude and phase estimation with deep resunet for music source separation
Q Kong, Y Cao, H Liu, K Choi, Y Wang
International Society for Music Information Retrieval Conference, 2021
782021
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research
X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ...
arXiv preprint arXiv:2303.17395, 2023
77*2023
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
H Liu, X Liu, Q Kong, Q Tian, Y Zhao, DL Wang, C Huang, Y Wang
INTERSPEECH, 2022
54*2022
Binauralgrad: A two-stage conditional diffusion probabilistic model for binaural audio synthesis
Y Leng, Z Chen, J Guo, H Liu, J Chen, X Tan, D Mandic, L He, X Li, T Qin, ...
Advances in Neural Information Processing Systems 35, 23689-23700, 2022
392022
AudioLDM 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ...
arXiv preprint arXiv:2308.05734, 2023
36*2023
Channel-wise subband input for better voice and accompaniment separation on high resolution music
H Liu, L Xie, J Wu, G Yang
INTERSPEECH, 2020
282020
Leveraging pre-trained bert for audio captioning
X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kilic, ...
2022 30th European Signal Processing Conference (EUSIPCO), 1145-1149, 2022
272022
Separate what you describe: language-queried audio source separation
X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang
INTERSPEECH, 2022
262022
Neural vocoder is all you need for speech super-resolution
H Liu, W Choi, X Liu, Q Kong, Q Tian, DL Wang
INTERSPEECH, 2022
242022
CWS-PResUNet: Music source separation with channel-wise subband phase-aware resunet
H Liu, Q Kong, J Liu
ISMIR Music Demixing (MDX) Workshop, 2021
202021
Language-based audio retrieval with pre-trained models
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
DCASE 2022 Challenge, Tech. Rep., 2022
192022
Learning to detect an animal sound from five examples
I Nolasco, S Singh, V Morfi, V Lostanlen, A Strandburg-Peshkin, ...
Ecological informatics 77, 102258, 2023
182023
Speech enhancement with weakly labelled data from AudioSet
Q Kong, H Liu, X Du, L Chen, R Xia, Y Wang
INTERSPEECH, 2021
162021
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ...
INTERSPEECH, 2022
122022
Joint echo cancellation and noise suppression based on cascaded magnitude and complex mask estimation
X Shu, Y Zhu, Y Chen, L Chen, H Liu, C Huang, Y Wang
arXiv preprint arXiv:2107.09298, 2021
122021
Surrey system for dcase 2022 task 5: Few-shot bioacoustic event detection with segment-level metric learning
H Liu, X Liu, X Mei, Q Kong, W Wang, MD Plumbley
arXiv preprint arXiv:2207.10547, 2022
112022
Simple pooling front-ends for efficient audio classification
X Liu, H Liu, Q Kong, X Mei, MD Plumbley, W Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
Resgrad: Residual denoising diffusion probabilistic models for text to speech
Z Chen, Y Wu, Y Leng, J Chen, H Liu, X Tan, Y Cui, K Wang, L He, S Zhao, ...
arXiv preprint arXiv:2212.14518, 2022
102022
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20