Post-training Quantization on Diffusion Models Y Shang, Z Yuan, B Xie, B Wu, Y Yan CVPR 2023, 2023 | 44 | 2023 |
Rptq: Reorder-based post-training quantization for large language models Z Yuan, L Niu, J Liu, W Liu, X Wang, Y Shang, G Sun, Q Wu, J Wu, B Wu arXiv preprint arXiv:2304.01089, 2023 | 35 | 2023 |
Lipschitz Continuity Guided Knowledge Distillation Y Shang, B Duan, Z Zong, L Nie, Y Yan ICCV 2021, 2021 | 28 | 2021 |
Network Binarization via Contrastive Learning Y Shang, X Dan, Z Zong, L Nie, Y Yan ECCV 2022, 2022 | 18 | 2022 |
Lipschitz Continuity Retained Binary Neural Network Y Shang, X Dan, B Duan, Z Zong, L Nie, Y Yan ECCV 2022, 2022 | 14 | 2022 |
ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models Z Yuan*, Y Shang*, Y Song, Q Wu, Y Yan, G Sun arXiv preprint arXiv:2312.05821, 2023 | 5 | 2023 |
LLM Inference Unveiled: Survey and Roofline Model Insights Z Yuan*, Y Shang*, Y Zhou*, Z Dong, C Xue, B Wu, Z Li, Q Gu, YJ Lee, ... arXiv preprint arXiv:2402.16363, 2024 | 4 | 2024 |
Causal-DFQ: Causality Guided Data-free Network Quantization Y Shang, B Xu, G Liu, R Kompella, Y Yan ICCV 2023, 2023 | 2 | 2023 |
Bpt: binary point cloud transformer for place recognition Z Hou, Y Shang, T Gao, Y Yan arXiv preprint arXiv:2303.01166, 2023 | 2 | 2023 |
Supplementing missing visions via dialog for scene graph generations Z Zhao, Y Zhu, X Zhu, Y Shang, Y Yan ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Network specialization via feature-level knowledge distillation G Liu, Y Shang, Y Yao, R Kompella Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 1 | 2023 |
Efficient Multitask Dense Predictor via Binarization Y Shang, D Xu, G Liu, RR Kompella, Y Yan CVPR 2024, 2024 | | 2024 |
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Y Shang, M Cai, B Xu, YJ Lee, Y Yan arXiv preprint arXiv:2403.15388, 2024 | | 2024 |
FBPT: A Fully Binary Point Transformer Z Hou, Y Shang, Y Yan arXiv preprint arXiv:2403.09998, 2024 | | 2024 |
Online Multi-spectral Neuron Tracing B Duan, Y Shang, D Cai, Y Yan arXiv preprint arXiv:2403.06251, 2024 | | 2024 |
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning H Wang, Y Shang, Z Yuan, J Wu, Y Yan arXiv preprint arXiv:2402.03666, 2024 | | 2024 |
MIM4DD: Mutual Information Maximization for Dataset Distillation Y Shang, Z Yuan, Y Yan NeurIPS 2023, 2023 | | 2023 |
Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning Y Shang, B Duan, Z Zong, L Nie, Y Yan ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | | 2022 |