Search
Now showing items 1-1 of 1
Toward a Transparent, Checkpointable Fault-Tolerant Message Passing Interface for HPC Systems
(2019-12-09)
With each successive generation of large-scale high-performance computing (HPC) systems, faults and associated failures are becoming more frequent. Long-running applications in such systems require efficient fault-tolerance ...