Follow
Zihao Ye
Zihao Ye
PhD of Computer Science, University of Washington
Verified email at cs.washington.edu - Homepage
Title
Cited by
Cited by
Year
Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks
M Wang, D Zheng, Z Ye, Q Gan, M Li, J Zhou, C Ma, L Yu, Y Gai, T Xiao, ...
arXiv preprint arXiv:1909.01315, 0
2121*
Dgl-ke: Training knowledge graph embeddings at scale
D Zheng, X Song, C Ma, Z Tan, Z Ye, J Dong, H Xiong, Z Zhang, ...
Proceedings of the 43rd International ACM SIGIR Conference on Research and …, 2020
2052020
Featgraph: A flexible and efficient backend for graph neural network systems
Y Hu, Z Ye, M Wang, J Yu, D Zheng, M Li, Z Zhang, Z Zhang, Y Wang
SC20: International Conference for High Performance Computing, Networking …, 2020
972020
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
Z Ye, Q Guo, Q Gan, X Qiu, Z Zhang
arXiv preprint arXiv:1911.04070, 2019
822019
SparseTIR: Composable abstractions for sparse compilation in deep learning
Z Ye, R Lai, J Shao, T Chen, L Ceze
Proceedings of the 28th ACM International Conference on Architectural …, 2023
772023
Atom: Low-bit quantization for efficient and accurate llm serving
Y Zhao, CY Lin, K Zhu, Z Ye, L Chen, S Zheng, L Ceze, A Krishnamurthy, ...
Proceedings of Machine Learning and Systems 6, 196-209, 2024
732024
Tensorir: An abstraction for automatic tensorized program optimization
S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
642023
Punica: Multi-tenant lora serving
L Chen, Z Ye, Y Wu, D Zhuo, L Ceze, A Krishnamurthy
Proceedings of Machine Learning and Systems 6, 1-13, 2024
342024
Graphiler: Optimizing Graph Neural Networks with Message Passing Data Flow Graph
Z Xie, M Wang, Z Ye, Z Zhang, R Fan
Proceedings of Machine Learning and Systems 4, 515-528, 2022
252022
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning
R Lai, J Shao, S Feng, SS Lyubomirsky, B Hou, W Lin, Z Ye, H Jin, Y Jin, ...
arXiv preprint arXiv:2311.02103, 2023
82023
NanoFlow: Towards Optimal Large Language Model Serving Throughput
K Zhu, Y Zhao, L Zhao, G Zuo, Y Gu, D Xie, Y Gao, Q Xu, T Tang, Z Ye, ...
arXiv preprint arXiv:2408.12757, 2024
32024
MagicPIG: LSH Sampling for Efficient LLM Generation
Z Chen, R Sadhukhan, Z Ye, Y Zhou, J Zhang, N Nolte, Y Tian, M Douze, ...
arXiv preprint arXiv:2410.16179, 2024
12024
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs
S Zheng, R Chen, M Li, Z Ye, L Ceze, Y Liang
Proceedings of Machine Learning and Systems 6, 452-464, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–13