Thomas Herault
TitleCited byYear
MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes
G Bosilca, A Bouteiller, F Cappello, S Djilali, G Fedak, C Germain, ...
SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 29-29, 2002
4252002
DAGuE: A generic distributed DAG engine for high performance computing
G Bosilca, A Bouteiller, A Danalis, T Herault, P Lemarinier, J Dongarra
Parallel Computing 38 (1-2), 37-51, 2012
3582012
Computing on large-scale distributed systems: XtremWeb architecture, programming models, security, tests and convergence with grid
F Cappello, S Djilali, G Fedak, T Herault, F Magniette, V Néri, ...
Future generation computer systems 21 (3), 417-437, 2005
3362005
Approximate probabilistic model checking
T Hérault, R Lassaigne, F Magniette, S Peyronnet
International Workshop on Verification, Model Checking, and Abstract …, 2004
3132004
MPICH-V2: a fault tolerant MPI for volatile nodes based on pessimistic sender based message logging
A Bouteiller, F Cappello, T Herault, G Krawezik, P Lemarinier, F Magniette
Proceedings of the 2003 ACM/IEEE conference on Supercomputing, 25, 2003
2612003
MPICH-V project: A multiprotocol automatic fault-tolerant MPI
A Bouteiller, T Herault, G Krawezik, P Lemarinier, F Cappello
The International Journal of High Performance Computing Applications 20 (3 …, 2006
1872006
Parsec: Exploiting heterogeneity to enhance scalability
G Bosilca, A Bouteiller, A Danalis, M Faverge, T Hérault, JJ Dongarra
Computing in Science & Engineering 15 (6), 36-45, 2013
1752013
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA
G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ...
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
151*2011
Post-failure recovery of MPI communication capability: Design and rationale
W Bland, A Bouteiller, T Herault, G Bosilca, J Dongarra
The International Journal of High Performance Computing Applications 27 (3 …, 2013
1482013
Algorithm-based fault tolerance for dense matrix factorizations
P Du, A Bouteiller, G Bosilca, T Herault, J Dongarra
Acm sigplan notices 47 (8), 225-234, 2012
1442012
Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI
C Coti, T Herault, P Lemarinier, L Pilard, A Rezmerita, E Rodriguezb, ...
SC'06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, 18-18, 2006
1092006
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
European MPI Users' Group Meeting, 193-203, 2012
1022012
Improved message logging versus improved coordinated checkpointing for fault tolerant MPI
P Lemarinier, A Bouteiller, T Herault, G Krawezik, F Cappello
2004 IEEE International Conference on Cluster Computing (IEEE Cat. No …, 2004
982004
Fault-tolerance techniques for high-performance computing
T Herault, Y Robert
Springer, 2015
962015
Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI protocols
D Buntinas, C Coti, T Herault, P Lemarinier, L Pilard, A Rezmerita, ...
Future Generation Computer Systems 24 (1), 73-84, 2008
832008
Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics): Preface
D Ünay, Z Çataltepe, S Aksoy
Lecture Notes in Computer Science (including subseries Lecture Notes in …, 2010
782010
Unified model for assessing checkpointing protocols at extreme‐scale
G Bosilca, A Bouteiller, E Brunet, F Cappello, J Dongarra, A Guermouche, ...
Concurrency and Computation: Practice and Experience 26 (17), 2772-2791, 2014
712014
Probabilistic model checking of the CSMA/CD protocol using PRISM and APMC
M Duflot, L Fribourg, T Herault, R Lassaigne, F Magniette, S Messika, ...
Electronic Notes in Theoretical Computer Science 128 (6), 195-214, 2005
622005
Hierarchical QR factorization algorithms for multi-core clusters
J Dongarra, M Faverge, T Herault, M Jacquelin, J Langou, Y Robert
Parallel Computing 39 (4-5), 212-232, 2013
532013
Correlated set coordination in fault tolerant message logging protocols
A Bouteiller, T Herault, G Bosilca, JJ Dongarra
European Conference on Parallel Processing, 51-64, 2011
492011
The system can't perform the operation now. Try again later.
Articles 1–20