Unlock instant, AI-driven research and patent intelligence for your innovation.

Scheduling system and method for equalizing memory access latency among multiple threads under NUMA architecture

A multi-threaded, balanced technology, applied in the field of scheduling systems that balances the memory access delay between multiple threads, can solve problems such as slow running speed, large remote memory accesses, and dragging down the running speed of the program, and achieve the effect of reasonable adjustment granularity

Active Publication Date: 2016-06-22
HUAZHONG UNIV OF SCI & TECH
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For multi-threaded programs that have thread synchronization operations during operation, the running speed of each thread needs to be considered when running under the NUMA architecture. If there are some threads among the threads that need to be synchronized, due to the large number of remote memory accesses executed, If the running speed is slow, then this thread becomes the key thread that drags down the running speed of the program. At this time, in order to reduce the work done by other threads for remote memory access, it will not improve the overall performance of the final program very well.
Among the existing optimization tools for program running performance under the NUMA architecture, there is a lack of optimization methods for the problem of balancing memory access delays between multiple threads

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scheduling system and method for equalizing memory access latency among multiple threads under NUMA architecture
  • Scheduling system and method for equalizing memory access latency among multiple threads under NUMA architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0047] Such as figure 1 As shown, the present invention is a scheduling system that balances the memory access delay between multiple threads under the NUMA architecture, and the system includes a detection module, a sampling module, an analysis module, a judgment module and a scheduling module, wherein,

[0048] The detection module is used to detect whether the program enters the multi-thread parallel execution area, and is also used to start the sampling module after the detection program enters the multi-thread parallel execution area;

[0049] The sampling module is used to sample the memory ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a scheduling system and method for equalizing memory access latency among multiple threads under a NUMA architecture. The system comprises a detecting module, a sampling module, an analyzing module, a judging module and a scheduling module. Through sampling and storing memory access information of each thread during the operating process of a multi-thread program, predicting and analyzing whether the memory access latency of the threads in the multi-thread program is out of balance or not, and scheduling reasonably according to the analysis result, performing migration scheduling on thread access variables of far-end memory access to nodes where threads are placed or performing interleaved storage to equally distribute the thread access variables to all nodes, so that memory access latency of all the threads is ensured to be basically equal. According to the scheduling system and method, by way of equalizing memory access latency among multiple threads, the operating performance of the multi-thread program under the NUMA architecture can be optimized; through a fine grit, the real-time scheduling is performed in a targeted way so that the performance optimization of the multi-thread program in a parallel region is achieved.

Description

technical field [0001] The invention belongs to the field of multi-thread performance optimization under computer architecture, and more specifically, relates to a scheduling system and method for balancing memory access delay among multi-threads under NUMA architecture. Background technique [0002] Non-uniform memory access (NUMA) architecture is one of the popular commercial server architectures. It adopts a distributed memory model, and the processors of all nodes can access all physical memory, which is easy to manage and has good scalability. Therefore, Has been widely used. [0003] In the NUMA architecture, the memory accessed by each CPU can be divided into two types: the memory on the same node as the CPU is called local memory, and the access delay is very low; the memory on a different node from the CPU is called remote memory. To access the end memory, the CPU needs to be connected through nodes, so its access delay is longer than that of the local memory. Thi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/48
CPCG06F9/4881
Inventor 金海廖小飞朱亮曾丹
Owner HUAZHONG UNIV OF SCI & TECH