Pudiannao: A polyvalent machine learning accelerator
D Liu, T Chen, S Liu, J Zhou, S Zhou, O Teman, X Feng, X Zhou, Y Chen
ACM SIGARCH Computer Architecture News 43 (1), 369-381, 2015
Tunao: A high-performance and energy-efficient reconfigurable accelerator for graph processing
J Zhou, S Liu, Q Guo, X Zhou, T Zhi, D Liu, C Wang, X Zhou, Y Chen, ...
2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2017
Journal of Computer Research and Development 50 (10), 2212-2227t, 2013
Deterministic replay using global clock
Y Chen, T Chen, L Li, R Wu, D Liu, W Hu
ACM Transactions on Architecture and Code Optimization (TACO) 10 (1), 1-28, 2013
An FFT Performance Model for Optimizing General-Purpose Processor Architecture
L Li, YJ Chen, DF Liu, C Qian, WW Hu
Journal of Computer Science and Technology 26 (5), 875-889, 2011
多核处理器片上网络 trace 压缩方法
钱诚, 刘道福, 陈云霁
高技术通讯 21 (3), 254-260, 2011
Performance prediction for reconfigurable processor
D Liu, Q Guo, T Chen, L Li, Y Chen
2012 IEEE 14th International Conference on High Performance Computing and …, 2012
DLS: Directoryless Shared Last-level Cache
D Liu, Y Chen, Q Guo, T Chen, L Li, Q Dong, W Hu
arXiv preprint arXiv:1206.4753, 2012
ParaML: A Polyvalent Multi-core Accelerator for Machine Learning
S Zhou, Q Guo, Z Du, D Liu, T Chen, L Li, S Liu, J Zhou, O Teman, X Feng, ...
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2019
