Följ
Yi Yuan
Yi Yuan
University of Surrey, Centre for Vision, Speech, and Signal processing (CVSSP)
Verifierad e-postadress på surrey.ac.uk
Titel
Citeras av
Citeras av
År
Audioldm: Text-to-audio generation with latent diffusion models
H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
arXiv preprint arXiv:2301.12503, 2023
2162023
AudioLDM 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ...
arXiv preprint arXiv:2308.05734, 2023
202023
Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, and Mark D Plumbley. Audioldm 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei
arXiv preprint arXiv:2308.05734 8, 2023
202023
Separate anything you describe
X Liu, Q Kong, Y Zhao, H Liu, Y Yuan, Y Liu, R Xia, Y Wang, MD Plumbley, ...
arXiv preprint arXiv:2308.05037, 2023
82023
Latent diffusion model based foley sound generation system for dcase challenge 2023 task 7
Y Yuan, H Liu, X Liu, X Kang, MD Plumbley, W Wang
arXiv preprint arXiv:2305.15905, 2023
82023
Leveraging pre-trained audioldm for sound generation: A benchmark study
Y Yuan, H Liu, J Liang, X Liu, MD Plumbley, W Wang
2023 31st European Signal Processing Conference (EUSIPCO), 765-769, 2023
62023
Text-driven foley sound generation with latent diffusion model
Y Yuan, H Liu, X Liu, X Kang, P Wu, MD Plumbley, W Wang
arXiv preprint arXiv:2306.10359, 2023
62023
Mlops spanning whole machine learning life cycle: A survey
F Zhengxin, Y Yi, Z Jingyu, L Yue, M Yuechen, L Qinghua, X Xiwei, W Jeff, ...
arXiv preprint arXiv:2304.07296, 2023
62023
Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, and Mark D Plumbley. 2023. AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei
arXiv preprint arXiv:2308.05734 3, 2023
62023
Wavjourney: Compositional audio creation with large language models
X Liu, Z Zhu, H Liu, Y Yuan, M Cui, Q Huang, J Liang, Y Cao, Q Kong, ...
arXiv preprint arXiv:2307.14335, 2023
52023
Retrieval-augmented text-to-audio generation
Y Yuan, H Liu, X Liu, Q Huang, MD Plumbley, W Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
22024
PLDISET: Probabilistic localization and detection of independent sound events with transformers
P Wu, J Zhao, Y Chen, D Berghi, Y Yuan, C Zhu, Y Cao, Y Liu, ...
Detection and Classification of Acoustic Scenes and Events 2023, 2023
12023
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study
Y Yuan, H Liu, J Liang, X Liu, MD Plumbley, W Wang
arXiv preprint arXiv:2303.03857, 2023
2023
HFM++: An Enhanced Holographic Factorization Machine for Recommendation
Z Fang, M Qu, S Zhang, J Zhang, Y Yuan, L Yao, S Chen
Australasian Conference on Data Mining, 72-85, 2021
2021
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–14