Easy and Efficient Multilevel Checkpointing for Extreme Scale Systems
by Rich Brueckner from High-Performance Computing News Analysis | insideHPC on (#3VFQ7)
Leonardo Bautista from the Barcelona Supercomputing Center gave this talk at PASC18. "Extreme scale supercomputers offer thousands of computing nodes to their users to satisfy their computing needs. As the need for massively parallel computing increases in industry, computing centers are being forced to increase in size and to transition to new computing technologies. In this talk, we will discuss how to guarantee high reliability to high performance applications running in extreme scale supercomputers. In particular, we cover the tools necessary to implement scalable multilevel checkpointing for tightly coupled applications."
The post Easy and Efficient Multilevel Checkpointing for Extreme Scale Systems appeared first on insideHPC.