Unlock instant, AI-driven research and patent intelligence for your innovation.

A multi-disk load balancing asynchronous read and write scheduling method and device

A technology of load balancing and asynchronous reading and writing, which is applied in the database field to improve data reading and writing efficiency and reduce performance overhead

Active Publication Date: 2022-02-18
北京东方国信科技股份有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, under the HDFS system, the data nodes store the actual data on multiple disks, including ordinary mechanical disks and SSD solid state disks. During the execution of the database, the temporary data also needs to be saved on these disks of the data nodes. Most databases are divided into The local data stored in HDFS is directly read and written through the HDFS interface, and the database also reads and writes local temporary files through the interface of the local file system, resulting in synchronous reading and writing of multiple different types of disks under concurrent multiple requests, and is The reads and writes of different types of disks are mixed with each other, resulting in the performance overhead caused by multiple threads scrambling and multiple disks scrambling under high concurrency
At this stage, there is no asynchronous read and write technical solution for load balancing multiple disks under the HDFS system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-disk load balancing asynchronous read and write scheduling method and device
  • A multi-disk load balancing asynchronous read and write scheduling method and device
  • A multi-disk load balancing asynchronous read and write scheduling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The implementation mode of the present invention is illustrated by specific specific examples below, and those who are familiar with this technology can easily understand other advantages and effects of the present invention from the contents disclosed in this description. Obviously, the described embodiments are a part of the present invention. , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0038] see figure 1 , this embodiment provides a multi-disk load balancing asynchronous read and write scheduling method, including the following steps:

[0039] S1: Configure HDFS so that the data nodes of the HDFS system report the disk information to the name node, and start the metadata query service process on the name node. The disk information includes the disk device number and disk type...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Multi-disk load balancing asynchronous read-write scheduling method and device, HDFS is configured, so that the data nodes of the HDFS system report disk information to the name node, and start the metadata query service process on the name node; obtain all local disk information, and Number the disks according to the disk type in the disk information; create independent read and write work threads for each disk, so that the disks wait for the upper layer to send read and write tasks; the metadata query process receives the HDFS data requests that the upper layer users want to read, and passes through the name node The metadata query service process obtains the disk number of the HDFS data to be read; when sending a read-write task to the read-write worker thread, it also sends the disk number, and when the read-write worker thread receives the read-write task, it processes each task asynchronously according to the disk information The read and write tasks are notified to the upper layer users after completion. Reduce the performance overhead caused by multiple thread contention and multiple disk contention under high concurrency, and improve data read and write efficiency.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of databases, in particular to a multi-disk load balancing asynchronous read and write scheduling method and device. Background technique [0002] HDFS refers to a distributed file system. HDFS is usually designed as a distributed file system suitable for running on general-purpose hardware. HDFS is a highly fault-tolerant system, which is suitable for deployment on cheap machines, can provide high-throughput data access, and is very suitable for applications on large-scale data sets. HDFS is a master-slave structure. An HDFS cluster consists of a name node. The name node is a master server that manages the file namespace and regulates client access to files. There are also some data nodes, usually a node and a machine. The data nodes come from Manage the storage of the corresponding node. HDFS opens the file namespace to the outside world and allows user data to be stored in the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F3/06G06F16/22
Inventor 董隆超陈兴利张娇娇
Owner 北京东方国信科技股份有限公司