‪Linjun Li‬ - ‪Google Scholar‬

Skapa en profil

Citeras av

	Alla	Sedan 2019
Citat	66	66
h-index	5	5
i10-index	1	1

0

48

24

2023202447 19

Offentlig åtkomst

2 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Zhou ZhaoZhejiang UniversityVerifierad e-postadress på zju.edu.cn
Xize Cheng（成曦泽）Zhejiang UniversityVerifierad e-postadress på zju.edu.cn
Wang LinZhejiang UniversityVerifierad e-postadress på zju.edu.cn
Ye WangZhejiang UniversityVerifierad e-postadress på zju.edu.cn
Zehan WangZhejiang UniversityVerifierad e-postadress på zju.edu.cn
Rongjie HuangZhejiang UniversityVerifierad e-postadress på zju.edu.cn

Linjun Li

Linjun Li

Zhejiang University

Verifierad e-postadress på zju.edu.cn

Multi-modal Learning Computer Vision


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Mixspeech: Cross-modality self-learning with audio-visual stream mixup for visual speech translation and recognition X Cheng, T Jin, R Huang, L Li, W Lin, Z Wang, Y Wang, H Liu, A Yin, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	12	2023
Connecting multi-modal contrastive representations Z Wang, Y Zhao, H Huang, J Liu, A Yin, L Tang, L Li, Y Wang, Z Zhang, ... Advances in Neural Information Processing Systems 36, 22099-22114, 2023	9	2023
Multi-granularity relational attention network for audio-visual question answering L Li, T Jin, W Lin, H Jiang, W Pan, J Wang, S Xiao, Y Xia, W Jiang, Z Zhao IEEE Transactions on Circuits and Systems for Video Technology, 2023	7	2023
Opensr: Open-modality speech recognition via maintaining multi-modality alignment X Cheng, T Jin, L Li, W Lin, X Duan, Z Zhao arXiv preprint arXiv:2306.06410, 2023	6	2023
Av-transpeech: Audio-visual robust speech-to-speech translation R Huang, H Liu, X Cheng, Y Ren, L Li, Z Ye, J He, L Zhang, J Liu, X Yin, ... arXiv preprint arXiv:2305.15403, 2023	6	2023
TAVT: Towards Transferable Audio-Visual Text Generation W Lin, T Jin, W Pan, L Li, X Cheng, Y Wang, Z Zhao Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	5	2023
Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
3drp-net: 3d relative position-aware network for 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao arXiv preprint arXiv:2307.13363, 2023	4	2023
Contrastive token-wise meta-learning for unseen performer visual temporal-aligned translation L Li, T Jin, X Cheng, Y Wang, W Lin, R Huang, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 10993-11007, 2023	4	2023
Weakly-supervised spoken video grounding via semantic interaction learning Y Wang, W Lin, S Zhang, T Jin, L Li, X Cheng, Z Zhao Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	3	2023
Semantic-conditioned dual adaptation for cross-domain query-based visual segmentation Y Wang, T Jin, W Lin, X Cheng, L Li, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 9797-9815, 2023	2	2023
Exploring group video captioning with efficient relational approximation W Lin, T Jin, Y Wang, W Pan, L Li, X Cheng, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	2	2023
Rethinking missing modality learning from a decoding perspective T Jin, X Cheng, L Li, W Lin, Y Wang, Z Zhao Proceedings of the 31st ACM International Conference on Multimedia, 4431-4439, 2023	1	2023
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation X Cheng, R Huang, L Li, T Jin, Z Wang, A Yin, M Li, X Duan, Z Zhao arXiv preprint arXiv:2312.15197, 2023		2023

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–14