Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Peripheral component interconnect express non-transparent bridge (PCIe NTB)-based dual-controller storage high-availability (HA) subsystem

A dual-controller, non-transparent technology used in response error generation, redundancy in hardware for error detection of data, input/output to record carriers, etc. Redirection, inability to form dual-controller storage high-availability subsystems, etc., to solve the problems of failover and failover, ensure consistency, and avoid loss

Active Publication Date: 2018-03-06
HANGZHOU EBOYLAMP ELECTRONICS CO LTD
View PDF18 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] At present, some people have implemented redundant heartbeat links based on PCIe non-transparent bridges alone, or implemented cache mirroring based on PCIe non-transparent bridges alone, but none of them have involved the I / O in the high-availability design of dual-controller storage. The work on redirection cannot yet constitute a complete solution for dual-controller storage high-availability subsystems, and further research is urgently needed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Peripheral component interconnect express non-transparent bridge (PCIe NTB)-based dual-controller storage high-availability (HA) subsystem
  • Peripheral component interconnect express non-transparent bridge (PCIe NTB)-based dual-controller storage high-availability (HA) subsystem
  • Peripheral component interconnect express non-transparent bridge (PCIe NTB)-based dual-controller storage high-availability (HA) subsystem

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] Embodiment 1. When the abnormality detection module detects that a service network port of the local end is faulty, it reports the abnormal event to the HA logic module.

[0042] After receiving the abnormal event, the HA logic module checks the status of each service network port of the local end detected by the abnormality detection module and the status of the peer service network port received by the abnormal detection module. When there is an available service network port at the local end, Issue the "Local IP Drift" command. After receiving the scheduling command, the I / O redirection module unbinds the IP of the faulty service network port through the ARP protocol, and transfers it to other available service networks at the local end according to the set priority order. IP binding is performed on the port, so that the service I / O is transferred to the bound service network port.

[0043] When the HA logic module has no available service network port at the local e...

Embodiment 2

[0044] Embodiment 2: If the abnormality detection module detects that all the service network ports of the local end are faulty, the abnormal event is reported to the HA logic module.

[0045] After receiving the abnormal event, the HA logic module checks the status of each service network port of the local end detected by the abnormality detection module, and the status of the opposite end service network port received by the abnormality detection module, and issues the "peer IP drift" command, After the I / O redirection module receives the scheduling instruction, it releases the IP binding of the faulty service network port of the local end through the ARP protocol, and notifies the I / O redirection module of the opposite end to perform IP binding on the available service network port of the opposite end, Then use the iptables address mapping technology to forward the service I / O back to the local end through the link corresponding to the PCIe NTB NIC interface to realize I / O r...

Embodiment 3

[0049] Embodiment 3: If the abnormal detection module detects that the opposite end is not in place, it reports the abnormal event to the HA logic module. When one of the controllers is down, for example, when the first controller is down, the anomaly detection module of the second controller senses the corresponding abnormal event and reports it to the HA logic module, and the HA logic module issues the "take over the peer storage service" command , at this time, the second controller takes over the storage resources under the first controller. The cache mirroring module switches to stand-alone working mode, and converts the mirrored data on the first controller side into cached data, switches the underlying storage mode to write-through mode, and then the I / O redirection module transfers the data originally belonging to the first controller through the ARP protocol. The IP of the service network port is bound to the available service network port of the local end; when the f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a peripheral component interconnect express non-transparent bridge (PCIe NTB)-based dual-controller storage high-availability (HA) subsystem. The system includes an anomaly detection module, an I / O redirection module, a cache mirror image module and an HA logic module located on each of a first controller and a second controller. The anomaly detection module senses an opposite-end in-location state through two heartbeat links, and also used for detecting a state of each service network port of an local end, passing state information of respective service network ports between the two controllers, and reporting an anomaly event to the HA logic module when a service network port anomaly or an opposite-end in-location state anomaly is detected. The HA logic module receives the anomaly event reported by the anomaly detection module, and issues a scheduling instruction. The I / O redirection module and the cache mirror image module realizing takeover for a fault controller according to the scheduling instruction. According to the system, storage service continuity in whole crashing of one of the controllers can be guaranteed, faults of components such as a heartbeat network and the service network ports can also be tolerated, and fault migration and fault recovery problems of different granularities are well solved.

Description

technical field [0001] The invention belongs to the technical field of high-availability subsystems in dual-controller storage system design, and in particular relates to a dual-controller storage high-availability subsystem based on a PCIe non-transparent bridge. Background technique [0002] With the development of storage array technology, the functions and characteristics of storage products are increasing day by day, and the I / O performance is also continuously improved. However, although there are various designs and implementations of dual-controller storage arrays, none of them have implemented a high-availability subsystem that takes both functions and performance into account. [0003] In the high-availability design of the dual-controller storage system, there are three essential key components: an abnormality detection module that can sense abnormal events, a network high-availability module that can ensure uninterrupted I / O services, and data consistency The ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/20G06F11/07G06F3/06
Inventor 范长军周杰朱敏杰陈康润李昀
Owner HANGZHOU EBOYLAMP ELECTRONICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products