Mayank Daga
Mayank Daga
Director, Radeon Technology Group, AMD
Verifierad e-postadress på amd.com - Startsida
TitelCiteras avÅr
On the efficacy of a fused CPU+ GPU processor (or APU) for parallel computing
M Daga, AM Aji, W Feng
2011 Symposium on Application Accelerators in High-Performance Computing …, 2011
1442011
Efficient sparse matrix-vector multiplication on GPUs using the CSR storage format
JL Greathouse, M Daga
Proceedings of the International Conference for High Performance Computing …, 2014
1222014
Exploiting coarse-grained parallelism in B+ tree searches on an APU
M Daga, M Nutter
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
362012
Architecture-aware mapping and optimization on a 1600-core gpu
M Daga, T Scogland, W Feng
2011 IEEE 17th International Conference on Parallel and Distributed Systems …, 2011
362011
Efficient breadth-first search on a heterogeneous processor
M Daga, M Nutter, M Meswani
2014 IEEE International Conference on Big Data (Big Data), 373-382, 2014
252014
Structural agnostic SpMV: Adapting CSR-adaptive for irregular matrices
M Daga, JL Greathouse
2015 IEEE 22nd International conference on high performance computing (HiPC …, 2015
242015
Bounding the effect of partition camping in GPU kernels
AM Aji, M Daga, W Feng
Proceedings of the 8th ACM International Conference on Computing Frontiers, 27, 2011
232011
An n log n generalized Born approximation
R Anandakrishnan, M Daga, AV Onufriev
Journal of Chemical Theory and Computation 7 (3), 544-559, 2011
222011
Exploring parallel programming models for heterogeneous computing systems
M Daga, ZS Tschirhart, C Freitag
2015 IEEE International Symposium on Workload Characterization, 98-107, 2015
152015
Implementing directed acyclic graphs with the heterogeneous system architecture
S Puthoor, AM Aji, S Che, M Daga, W Wu, BM Beckmann, G Rodgers
Proceedings of the 9th Annual Workshop on General Purpose Processing using …, 2016
112016
Towards accelerating molecular modeling via multi-scale approximation on a GPU
M Daga, W Feng, T Scogland
2011 IEEE 1st International Conference on Computational Advances in Bio and …, 2011
112011
Efficient sparse matrix-vector multiplication on parallel processors
M Daga, JL Greathouse
US Patent 9,697,176, 2017
72017
clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library
JL Greathouse, K Knox, J Poła, K Varaganti, M Daga
Proceedings of the 4th International Workshop on OpenCL, 7, 2016
72016
Architecture-Aware Optimization on a 1600-core Graphics Processor
M Daga, TRW Scogland, W Feng
52011
CampProf: a visual performance analysis tool for memory bound GPU kernels
AM Aji, M Daga, W Feng
Department of Computer Science, Virginia Polytechnic Institute & State …, 2010
52010
Multi-dimensional characterization of electrostatic surface potential computation on graphics processors
M Daga, W Feng
BMC bioinformatics 13 (5), S4, 2012
42012
Architecture-Aware Mapping and Optimization on Heterogeneous Computing Systems
M Daga
Virginia Tech, 2011
22011
Multiscale Approximation with Graphical Processing Units for Multiplicative Speedup in Molecular Dynamics
R Anandakrishnan, M Daga, A Onufriev, W Feng
Proceedings of the 7th ACM International Conference on Bioinformatics …, 2016
12016
On the performance, energy, and power of data-access methods in heterogeneous computing systems
R Kalidas, M Daga, K Krommydas, W Feng
2015 IEEE International Parallel and Distributed Processing Symposium …, 2015
12015
MIOpen: An Open Source Library For Deep Learning Primitives
J Khan, P Fultz, A Tamazov, D Lowell, C Liu, M Melesse, ...
arXiv preprint arXiv:1910.00078, 2019
2019
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20