Powerpack: Energy profiling and analysis of high-performance systems and applications R Ge, X Feng, S Song, HC Chang, D Li, KW Cameron IEEE Transactions on Parallel and Distributed Systems 21 (5), 658-671, 2009 | 537 | 2009 |
{Zero-offload}: Democratizing {billion-scale} model training J Ren, S Rajbhandari, RY Aminabadi, O Ruwase, S Yang, M Zhang, D Li, ... 2021 USENIX Annual Technical Conference (USENIX ATC 21), 551-564, 2021 | 320 | 2021 |
Hybrid MPI/OpenMP power-aware computing D Li, BR de Supinski, M Schulz, K Cameron, DS Nikolopoulos 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 195 | 2010 |
Destiny: A tool for modeling emerging 3d nvm and edram caches M Poremba, S Mittal, D Li, JS Vetter, Y Xie 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2015 | 194 | 2015 |
A survey of architectural approaches for managing embedded DRAM and non-volatile on-chip caches S Mittal, JS Vetter, D Li IEEE Transactions on Parallel and Distributed Systems 26 (6), 1524-1537, 2014 | 194 | 2014 |
Processing-in-memory for energy-efficient neural network training: A heterogeneous approach J Liu, H Zhao, MA Ogleari, D Li, J Zhao 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 145 | 2018 |
Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool D Li, JS Vetter, W Yu SC'12: Proceedings of the International Conference on High Performance …, 2012 | 123 | 2012 |
Unimem: Runtime data managementon non-volatile memory-based heterogeneous main memory K Wu, Y Huang, D Li Proceedings of the International Conference for High Performance Computing …, 2017 | 119 | 2017 |
Enabling and exploiting flexible task assignment on GPU through SM-centric program transformations B Wu, G Chen, D Li, X Shen, J Vetter Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 116 | 2015 |
Identifying opportunities for byte-addressable non-volatile memory in extreme-scale scientific applications D Li, JS Vetter, G Marin, C McCurdy, C Cira, Z Liu, W Yu 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 97 | 2012 |
The tradeoffs of fused memory hierarchies in heterogeneous computing architectures KL Spafford, JS Meredith, S Lee, D Li, PC Roth, JS Vetter Proceedings of the 9th conference on Computing Frontiers, 103-112, 2012 | 87 | 2012 |
Strategies for energy-efficient resource management of hybrid programming models D Li, BR De Supinski, M Schulz, DS Nikolopoulos, KW Cameron IEEE Transactions on parallel and distributed Systems 24 (1), 144-157, 2012 | 86 | 2012 |
PORPLE: An extensible optimizer for portable data placement on GPU G Chen, B Wu, D Li, X Shen 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 88-100, 2014 | 82 | 2014 |
Exploring hybrid memory for GPU energy efficiency through software-hardware co-design B Wang, B Wu, D Li, X Shen, W Yu, Y Jiao, JS Vetter Proceedings of the 22nd international conference on Parallel architectures …, 2013 | 81 | 2013 |
Performance analysis and characterization of training deep learning models on mobile device J Liu, J Liu, W Du, D Li 2019 IEEE 25th International Conference on Parallel and Distributed Systems …, 2019 | 68 | 2019 |
Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning J Ren, J Luo, K Wu, M Zhang, H Jeon, D Li 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 66* | 2021 |
Runtime data management on non-volatile memory-based heterogeneous memory for task-parallel programs K Wu, J Ren, D Li SC18: International Conference for High Performance Computing, Networking …, 2018 | 64 | 2018 |
Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach D Li, Z Chen, P Wu, JS Vetter Proceedings of the International Conference on High Performance Computing …, 2013 | 58 | 2013 |
Hm-ann: Efficient billion-point nearest neighbor search on heterogeneous memory J Ren, M Zhang, D Li Advances in Neural Information Processing Systems 33, 10672-10684, 2020 | 57 | 2020 |
Quantitatively modeling application resilience with the data vulnerability factor L Yu, D Li, S Mittal, JS Vetter SC'14: Proceedings of the International Conference for High Performance …, 2014 | 57 | 2014 |