Follow
Kevin Lin
Kevin Lin
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Deep learning of binary hash codes for fast image retrieval
K Lin, HF Yang, JH Hsiao, CS Chen
IEEE Conference on Computer Vision and Pattern Recognition Workshops, 27-35, 2015
7362015
End-to-end human pose and mesh reconstruction with transformers
K Lin, L Wang, Z Liu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1954-1963, 2021
5472021
Adversarial ranking for language generation
K Lin, D Li, X He, Z Zhang, MT Sun
Advances in Neural Information Processing Systems (NeurIPS), 3158-3168, 2017
4112017
Learning compact binary descriptors with unsupervised deep neural networks
K Lin, J Lu, CS Chen, J Zhou
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1183-1192, 2016
4112016
Supervised learning of semantics-preserving hash via deep convolutional neural networks
HF Yang, K Lin, CS Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (2), 437-451, 2018
3862018
GIT: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
Transactions on Machine Learning Research (TMLR), 2022
3212022
Mesh graphormer
K Lin, L Wang, Z Liu
IEEE/CVF International Conference on Computer Vision (ICCV), 12939-12948, 2021
2502021
The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang
arXiv preprint arXiv:2309.17421, 2023
1982023
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
1862023
VIOLET: End-to-end video-language transformers with masked visual-token modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv preprint arXiv:2111.12681, 2021
1732021
SwinBERT: End-to-end transformers with sparse attention for video captioning
K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17949 …, 2022
1682022
Mitigating hallucination in large multi-modal models via robust instruction tuning
F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang
ICLR 2024, 2024
122*2024
Vivo: Visual vocabulary pre-training for novel object captioning
X Hu, X Yin, K Lin, L Zhang, J Gao, L Wang, Z Liu
Proceedings of the AAAI Conference on Artificial Intelligence, 1575-1583, 2021
108*2021
Abandoned object detection via temporal consistency modeling and back-tracing verification for visual surveillance
K Lin, SC Chen, CS Chen, DTD Lin, YP Hung
IEEE Transactions on Information Forensic and Security 10 (7), 1359-1370, 2015
1072015
Mm-vet: Evaluating large multimodal models for integrated capabilities
W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang
arXiv preprint arXiv:2308.02490, 2023
1052023
Rapid clothing retrieval via deep learning of binary codes and hierarchical search
K Lin, HF Yang, KH Liu, JH Hsiao, CS Chen
ACM International Conference on Multimedia Retrieval (ICMR), 499–502, 2015
882015
Cross-domain complementary learning using pose for multi-person part segmentation
K Lin, L Wang, K Luo, Y Chen, Z Liu, MT Sun
IEEE Transactions on Circuits and Systems for Video Technology 31 (3), 1066 …, 2020
842020
Unsupervised deep learning of compact binary descriptors
K Lin, J Lu, CS Chen, J Zhou, MT Sun
IEEE Transactions on Pattern Analysis and Machine Intelligence 41 (6), 1501-1514, 2019
752019
Lavender: Unifying video-language understanding as masked language modeling
L Li, Z Gan, K Lin, CC Lin, Z Liu, C Liu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 23119 …, 2023
572023
Reco: Region-controlled text-to-image generation
Z Yang, J Wang, Z Gan, L Li, K Lin, C Wu, N Duan, Z Liu, C Liu, M Zeng, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14246 …, 2023
542023
The system can't perform the operation now. Try again later.
Articles 1–20