Fault tolerance management method of multi-core operation system

A technology of operating system and management method, which is applied in the field of operating system management of multi-core processors in embedded systems, can solve problems such as the inability of the system to perform fault-tolerant management, avoid the uncertainty of asynchronous events and data access conflicts, and improve system efficiency , the effect of reducing algorithm overhead

Active Publication Date: 2015-12-30
BEIJING INST OF CONTROL ENG
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

To avoid the problem that the system cannot perform fault-tolerant management due to the failure of the processor c

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fault tolerance management method of multi-core operation system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The invention provides a fault-tolerant management method for a multi-core operating system based on shared memory. The method enables the processor cores to monitor each other's states through an inter-core fault detection matrix. Work (including tasks and interrupts) is migrated to the healthy processor core and continues to run. The method can quickly detect a failed processor core, and can also ensure the normal operation of the system in extreme cases where only one core can work normally. The steps are as follows:

[0020] (1) Construct a global task list and an n×n inter-core fault detection matrix (n is the number of processor cores) in the system.

[0021] (2) Make a scheduler instance run on each processor core. The basic function of each scheduler instance is to start the global task from the global task according to the task scheduling algorithm when an interrupt occurs or the local (the processor core) has a task status change. Pick a ready task from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a fault tolerance management method of a multi-core operation system. The fault tolerance management method mainly comprises the steps of constructing an n*n inter-core fault detection array, periodically detecting the fact that other processor cores in the system have failure or not by utilizing the inter-core fault detection array through each processor core, wherein the fault detection of each processor core comprises the process of setting elements in a line k corresponding to each processor core in the inter-core fault detection array as TRUE at the beginning of each timing monitoring period; checking elements in a line J corresponding to each processor core in the array at the tail of each timing monitoring period, if one element e, J or k is not TRUE, judging that the processor core k corresponding to the line has fault, and finally completely resetting the elements in the line J. According to the system, each processor core has the capability of detecting faults of other cores in the processor, so that the fault tolerance capability is greatly improved.

Description

technical field [0001] The invention relates to the field of operating system management methods for embedded system multi-core processors, in particular to a multi-core operating system fault-tolerant management method. Background technique [0002] Using multi-core processors in embedded systems is an effective way to improve system computing power and reliability. By dividing the computing task into multiple threads that can be executed in parallel and distributing them to run on different processor cores, the computing speed can be greatly improved. The reliability of the system can be improved by enabling multiple processor cores to execute the same task synchronously and output the operation results after voting. [0003] During the operation of a multi-core system, when a processor core has a non-transient failure, the system needs to be able to sense the failure and reschedule the threads on the failed core to run on other processor cores. Among them, how to make t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/07
Inventor 王婧彭飞龚健吴一帆华更新刘鸿瑾许娜徐建
Owner BEIJING INST OF CONTROL ENG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products