A class of parallel tiled linear algebra algorithms for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Parallel Computing 35 (1), 38-53, 2009 | 579 | 2009 |

Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ... Journal of Physics: Conference Series 180 (1), 012037, 2009 | 465 | 2009 |

Communication-optimal parallel and sequential QR and LU factorizations J Demmel, L Grigori, M Hoemmen, J Langou SIAM Journal on Scientific Computing 34 (1), A206-A239, 2012 | 376 | 2012 |

Parallel tiled QR factorization for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Concurrency and Computation: Practice and Experience 20 (13), 1573-1590, 2008 | 240 | 2008 |

Algorithm-based fault tolerance applied to high performance computing G Bosilca, R Delmas, J Dongarra, J Langou Journal of Parallel and Distributed Computing 69 (4), 410-416, 2009 | 211 | 2009 |

Algorithm 842: A set of GMRES routines for real and complex arithmetics on high performance computers V Frayssé, L Giraud, S Gratton, J Langou ACM Transactions on Mathematical Software (TOMS) 31 (2), 228-238, 2005 | 181 | 2005 |

Tiled QR factorization algorithms H Bouwmeester, M Jacquelin, J Langou, Y Robert SC'11: Proceedings of 2011 International Conference for High Performance …, 2011 | 173 | 2011 |

Accelerating scientific computations with mixed precision algorithms M Baboulin, A Buttari, J Dongarra, J Kurzak, J Langou, J Langou, ... Computer Physics Communications 180 (12), 2526-2533, 2009 | 164 | 2009 |

Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 162* | 2011 |

Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems) J Langou, J Langou, P Luszczek, J Kurzak, A Buttari, J Dongarra SC'06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, 50-50, 2006 | 154 | 2006 |

The impact of multicore on math software A Buttari, J Dongarra, J Kurzak, J Langou, P Luszczek, S Tomov International Workshop on Applied Parallel Computing, 1-10, 2006 | 151 | 2006 |

The loss of orthogonality in the Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozloznik Computers & Mathematics with Applications 50 (7), 1069-1075, 2005 | 144 | 2005 |

Mixed precision iterative refinement techniques for the solution of dense linear systems A Buttari, J Dongarra, J Langou, J Langou, P Luszczek, J Kurzak The International Journal of High Performance Computing Applications 21 (4 …, 2007 | 127 | 2007 |

Fault tolerant high performance computing by a coding approach Z Chen, GE Fagg, E Gabriel, J Langou, T Angskun, G Bosilca, J Dongarra Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of …, 2005 | 127 | 2005 |

Rounding error analysis of the classical Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozložník, J van den Eshof Numerische Mathematik 101 (1), 87-100, 2005 | 126 | 2005 |

Handbook of parallel computing: models, algorithms and applications S Rajasekaran, J Reif CRC press, 2007 | 117* | 2007 |

LU factorization for accelerator-based systems E Agullo, C Augonnet, J Dongarra, M Faverge, J Langou, H Ltaief, ... 2011 9th IEEE/ACS International Conference on Computer Systems and …, 2011 | 77 | 2011 |

A set of GMRES routines for real and complex arithmetics V Frayssé, L Giraud, S Gratton, J Langou URL http://www. cerfacs. fr/algor/Softs/GMRES/index. html, 1997 | 60 | 1997 |

Hierarchical QR factorization algorithms for multi-core clusters J Dongarra, M Faverge, T Herault, M Jacquelin, J Langou, Y Robert Parallel Computing 39 (4-5), 212-232, 2013 | 55 | 2013 |

Plasma users guide E Agullo, J Dongarra, B Hadri, J Kurzak, J Langou, J Langou, H Ltaief, ... Technical report, ICL, UTK, 2009 | 55 | 2009 |