Följ
Lei Gong
Lei Gong
School of Computer Science and Technology, University of Science and Technology of China
Verifierad e-postadress på ustc.edu.cn
Titel
Citeras av
Citeras av
År
DLAU: A scalable deep learning accelerator unit on FPGA
C Wang, L Gong, Q Yu, X Li, Y Xie, X Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2016
4172016
MALOC: A fully pipelined FPGA accelerator for convolutional neural networks with all layers mapped on chip
L Gong, C Wang, X Li, H Chen, X Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2018
1142018
WinoNN: Optimizing FPGA-based convolutional neural network accelerators using sparse Winograd algorithm
X Wang, C Wang, J Cao, L Gong, X Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2020
392020
A power-efficient accelerator based on FPGAs for LSTM network
Y Zhang, C Wang, L Gong, Y Lu, F Sun, C Xu, X Li, X Zhou
2017 IEEE International Conference on Cluster Computing (CLUSTER), 629-630, 2017
362017
A ubiquitous machine learning accelerator with automatic parallelization on FPGA
C Wang, L Gong, X Li, X Zhou
IEEE Transactions on Parallel and Distributed Systems 31 (10), 2346-2359, 2020
332020
A high-performance accelerator for large-scale convolutional neural networks
F Sun, C Wang, L Gong, C Xu, Y Zhang, Y Lu, X Li, X Zhou
2017 IEEE International Symposium on Parallel and Distributed Processing …, 2017
262017
Via: A novel vision-transformer accelerator based on fpga
T Wang, L Gong, C Wang, Y Yang, Y Gao, X Zhou, H Chen
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022
232022
Reconfigurable hardware accelerators: Opportunities, trends, and challenges
C Wang, W Lou, L Gong, L Jin, L Tan, Y Hu, X Li, X Zhou
arXiv preprint arXiv:1712.04771, 2017
222017
Improving hw/sw adaptability for accelerating cnns on fpgas through a dynamic/static co-reconfiguration approach
L Gong, C Wang, X Li, X Zhou
IEEE Transactions on Parallel and Distributed Systems 32 (7), 1854-1865, 2020
192020
Implementation and optimization of the accelerator based on FPGA hardware for LSTM network
Y Zhang, C Wang, L Gong, Y Lu, F Sun, C Xu, X Li, X Zhou
2017 IEEE international symposium on parallel and distributed processing …, 2017
192017
A high-performance FPGA accelerator for sparse neural networks: work-in-progress
Y Lu, L Gong, C Xu, F Sun, Y Zhang, C Wang, X Zhou
Proceedings of the 2017 International Conference on Compilers, Architectures …, 2017
192017
WooKong: A ubiquitous accelerator for recommendation algorithms with custom instruction sets on FPGA
C Wang, L Gong, X Ma, X Li, X Zhou
IEEE Transactions on Computers 69 (7), 1071-1082, 2020
182020
Fpnet: Customized convolutional neural network for fpga platforms
Y Yang, C Wang, L Gong, X Zhou
2019 International Conference on Field-Programmable Technology (ICFPT), 399-402, 2019
182019
RV-CNN: Flexible and efficient instruction set for CNNs based on RISC-V processors
W Lou, C Wang, L Gong, X Zhou
Advanced Parallel Processing Technologies: 13th International Symposium …, 2019
162019
Octcnn: A high throughput fpga accelerator for cnns using octave convolution algorithm
W Lou, L Gong, C Wang, Z Du, X Zhou
IEEE Transactions on Computers 71 (8), 1847-1859, 2021
142021
An fpga based accelerator for clustering algorithms with custom instructions
C Wang, L Gong, F Jia, X Zhou
IEEE Transactions on Computers 70 (5), 725-732, 2020
142020
SOLAR: Services-oriented deep learning architectures-deep learning as a service
C Wang, L Gong, X Li, Q Yu, A Wang, P Hung, X Zhou
IEEE Transactions on Services Computing 14 (1), 262-273, 2017
142017
Work-in-progress: a power-efficient and high performance FPGA accelerator for convolutional neural networks
L Gong, C Wang, X Li, H Chen, X Zhou
2017 International Conference on Hardware/Software Codesign and System …, 2017
122017
Domino: Graph processing services on energy-efficient hardware accelerator
C Xu, C Wang, L Gong, L Jin, X Li, X Zhou
2018 IEEE International Conference on Web Services (ICWS), 274-281, 2018
112018
SparseNN: A performance-efficient accelerator for large-scale sparse neural networks
Y Lu, C Wang, L Gong, X Zhou
International Journal of Parallel Programming 46 (4), 648-659, 2018
82018
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20