Slim fly: A cost effective low-diameter network topology M Besta, T Hoefler SC'14: Proceedings of the International Conference for High Performance …, 2014 | 200 | 2014 |

Enabling highly-scalable remote memory access programming with MPI-3 one sided R Gerstenberger, M Besta, T Hoefler Proceedings of the International Conference on High Performance Computing …, 2013 | 131 | 2013 |

To push or to pull: On reducing communication and synchronization in graph computations M Besta, M Podstawski, L Groner, E Solomonik, T Hoefler Proceedings of the 26th International Symposium on High-Performance Parallel …, 2017 | 70 | 2017 |

Evaluating the cost of atomic operations on modern architectures H Schweizer, M Besta, T Hoefler 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015 | 61 | 2015 |

Programming abstractions for data locality A Tate, A Kamil, A Dubey, A Größlinger, B Chamberlain, B Goglin, ... | 39 | 2014 |

Scaling betweenness centrality using communication-efficient sparse matrix multiplication E Solomonik, M Besta, F Vella, T Hoefler Proceedings of the International Conference for High Performance Computing …, 2017 | 37 | 2017 |

Slimsell: A vectorizable graph representation for breadth-first search M Besta, F Marending, E Solomonik, T Hoefler 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017 | 35 | 2017 |

Survey and taxonomy of lossless graph compression and space-efficient graph representations M Besta, T Hoefler arXiv preprint arXiv:1806.01799, 2018 | 34 | 2018 |

A modular benchmarking infrastructure for high-performance and reproducible deep learning T Ben-Nun, M Besta, S Huber, AN Ziogas, D Peter, T Hoefler 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 32 | 2019 |

High-performance distributed rma locks P Schmid, M Besta, T Hoefler Proceedings of the 25th ACM International Symposium on High-Performance …, 2016 | 26 | 2016 |

Fault tolerance for remote memory access programming models M Besta, T Hoefler Proceedings of the 23rd international symposium on High-performance parallel …, 2014 | 25 | 2014 |

Demystifying graph databases: Analysis and taxonomy of data organization, system designs, and graph queries M Besta, E Peter, R Gerstenberger, M Fischer, M Podstawski, C Barthels, ... arXiv preprint arXiv:1910.09017, 2019 | 22 | 2019 |

Transformations of high-level synthesis codes for high-performance computing J de Fine Licht, S Meierhans, T Hoefler arXiv preprint arXiv:1805.08288, 2018 | 22 | 2018 |

Slim noc: A low-diameter on-chip network topology for high energy efficiency and scalability M Besta, SM Hassan, S Yalamanchili, R Ausavarungnirun, O Mutlu, ... ACM SIGPLAN Notices 53 (2), 43-55, 2018 | 22 | 2018 |

Accelerating irregular computations with hardware transactional memory and active messages M Besta, T Hoefler Proceedings of the 24th International Symposium on High-Performance Parallel …, 2015 | 22 | 2015 |

Red-blue pebbling revisited: near optimal parallel matrix-matrix multiplication G Kwasniewski, M Kabić, M Besta, J VandeVondele, R Solcà, T Hoefler Proceedings of the International Conference for High Performance Computing …, 2019 | 18 | 2019 |

Graph processing on fpgas: Taxonomy, survey, challenges M Besta, D Stanojevic, JDF Licht, T Ben-Nun, T Hoefler arXiv preprint arXiv:1903.06697, 2019 | 17 | 2019 |

Active access: A mechanism for high-performance distributed data-centric computations M Besta, T Hoefler Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 17 | 2015 |

Substream-centric maximum matchings on fpga M Besta, M Fischer, T Ben-Nun, J de Fine Licht, T Hoefler Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019 | 16 | 2019 |

Communication-avoiding parallel minimum cuts and connected components L Gianinazzi, P Kalvoda, A De Palma, M Besta, T Hoefler ACM SIGPLAN Notices 53 (1), 219-232, 2018 | 16 | 2018 |