Method for pausing and restoring MPI (message passing interface) parallel application running
An application program and continuous operation technology, applied in the computer field, can solve the problems of implementation difficulties, limited communication protocol support, and process communication timeout exit, etc., to achieve the effect of convenient control and scheduling
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the present invention will be further described below in conjunction with the accompanying drawings and specific examples.
[0032] figure 1 The processing process of this method "pause or resume the operation of MPI parallel application program" is described, and the main process is as follows:
[0033] Step 1. Transform the implementation of the TCP communication protocol in the Linux operating system, and add the control interface function tcp_ioctl_MPI() in the implementation of the TCP communication protocol to query the detailed status of the communication between MPI processes, and then control the communication between processes and process each process communication synchronization problem.
[0034] Step 2. Transform the signal mechanism in the Linux operating system, modify the interface function catch_tstp() of "handling the pause ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
