OMPI_CRCP - Open MPI MCA Checkpoint/Restart Coordination Protocol (CRCP)
Framework: Overview of Open MPI's CRCP framework, and selected modules. Open
The CRCP Framework is used by Open MPI for the encapsulation of various
Checkpoint/Restart Coordination Protocols (e.g., Coordinated, Uncoordinated,
Message/Communication Induced, ...).
In order for a process to use the Open MPI CRCP components it must adhear to a
few programmatic requirements.
First, the program must call MPI_INIT
early in its execution.
The program must call MPI_FINALIZE
A user may initiate a checkpoint of a parallel application by using the
ompi-checkpoint(1) and ompi-restart(1) commands.
Open MPI currently ships with one CRCP component: coord
The following MCA parameters apply to all components:
- Set the verbosity level for all components. Default is 0, or silent except
component implements a Coordinated Checkpoint/Restart
Coordination Protocol similar to the one implemented in LAM/MPI.
component has the following MCA parameters:
- The component's priority to use when selecting the most appropriate
component for a run.
- Set the verbosity level for this component. Default is 0, or silent except
component simply selects no CRCP component. All of the CRCP
function calls return immediately with OMPI_SUCCESS.
This component is the last component to be selected by default. This means that
if another component is available, and the none
component was not
explicity requested then Open MPI will attempt to activate all of the
available components before falling back to this component.
ompi-checkpoint(1), ompi-restart(1), opal-checkpoint(1), opal-restart(1),
orte_snapc(7), orte_filem(7), opal_crs(7)