Linked e-resources
Details
Table of Contents
Part I: General Overview
Fault-Tolerance Techniques for High-Performance Computing
Part II: Technical Contributions
Errors and Faults
Fault-Tolerant MPI
Using Replication for Resilience on Exascale Systems
Energy-Aware Check pointing Strategies.
Fault-Tolerance Techniques for High-Performance Computing
Part II: Technical Contributions
Errors and Faults
Fault-Tolerant MPI
Using Replication for Resilience on Exascale Systems
Energy-Aware Check pointing Strategies.