Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance C Tang, K Ouyang, Z Wang, Y Zhu, W Ji, Y Wang, W Zhu Proc. of ECCV, 2022 | 34 | 2022 |
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices C Tang, LL Zhang, H Jiang, J Xu, T Cao, Q Zhang, Y Yang, Z Wang, ... Proc. of ICCV, 2023 | 10 | 2023 |
Arbitrary Bit-width Network: A Joint Layer-Wise Quantization and Adaptive Inference Approach C Tang, H Zhai, K Ouyang, Z Wang, Y Zhu, W Zhu Proc. of ACM MM, 2022 | 9 | 2022 |
SEAM: Searching Transferable Mixed-Precision Quantization Policy through Large Margin Regularization C Tang, K Ouyang, Z Chai, Y Bai, Y Meng, Z Wang, W Zhu Proc. of ACM MM, 2023 | 5 | 2023 |
Social-aware Sparse Attention Network for Session-based Social Recommendation K Ouyang, X Xu, C Tang, W Chen, H Zheng Findings of EMNLP, 2022 | 4 | 2022 |
Click-Aware Structure Transfer with Sample Weight Assignment for Post-Click Conversion Rate Estimation K Ouyang, W Zheng, C Tang, X Xiao, HT Zheng Proc. of ECML-PKDD, 2023 | 3 | 2023 |
Retraining-free Model Quantization via One-Shot Weight-Coupling Learning C Tang, Y Meng, J Jiang, S Xie, R Lu, X Ma, Z Wang, W Zhu Proc. of CVPR, 2024 | 1 | 2024 |
TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models H Sun, C Tang, Z Wang, Y Meng, X Ma, W Zhu arXiv preprint arXiv:2404.09532, 2024 | | 2024 |
Investigating the Impact of Quantization on Adversarial Robustness Q Li, Y Meng, C Tang, J Jiang, Z Wang arXiv preprint arXiv:2404.05639, 2024 | | 2024 |
DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing R Lu, Y Jiang, Y Mao, C Tang, B Chen, L Cui, Z Wang arXiv preprint arXiv:2311.07324, 2023 | | 2023 |
Knowledge Soft Integration for Multimodal Recommendation K Ouyang, C Tang, W Zheng, X Xie, X Xiao, J Dong, HT Zheng, Z Wang arXiv preprint arXiv:2305.07419, 2023 | | 2023 |
AdaConfigure: Reinforcement Learning-Based Adaptive Configuration for Video Analytics Services Z He, Y Wang, C Tang, Z Wang, W Zhu, C Guo, Z Chen International Conference on Multimedia Modeling (MMM), 2022, 2022 | | 2022 |
An Adaptive Logarithm Quantization Method for DNN Compression Y Wang, Z He, C Tang, Z Wang, W Zhu International Conference on Neural Information Processing (ICONIP), 2021, 2021 | | 2021 |
Appendix for Retraining-free Model Quantization via One-Shot Weight-Coupling Learning C Tang, Y Meng, J Jiang, S Xie, R Lu, X Ma, Z Wang, W Zhu, AE Setups | | |
Supplementary Material for Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance C Tang, K Ouyang, Z Wang, Y Zhu, W Ji, Y Wang, W Zhu | | |