Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Implementation method of pipeline pre-branching

A branch prediction and program technology, which is applied in program control design, instrumentation, electrical digital data processing, etc., can solve problems such as single function, uncertain pipeline hardware resource branch prediction, and high cost

Pending Publication Date: 2021-02-26
王志平
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, under the existing background technical conditions, whether the pipeline hardware of the two "threads" existing in the kernel is actually used by the "threads" depends on the scheduling of the operating system, that is, the program itself cannot be sure that all pipeline hardware resources are available. Can be used for branch prediction
Therefore, under the current background technical conditions, unless the software, especially the system software, can do more "coordination" or "synchronization", theoretically it can be realized that the program can be sure whether all pipeline hardware resources can be used for branch prediction, But in this way, it does not have the meaning of actually improving efficiency (so, in fact, under the existing background technology conditions, the high-end processing core does not realize more threads than 2 threads, that is, pipeline hardware resources, precisely because of the realization of More "threads" whose functions are too single
Single-threaded multi-branch applications, which account for the majority of applications, will cause excessive cost and waste of power consumption due to excessive pipeline hardware resources being in a useless state)
In addition, even if the existing background technology realizes the above-mentioned theoretical possibility, the existing background technology still cannot solve the problem of ultra-long pipeline branch prediction well, because in practical applications, within the execution time of a single instruction (instruction from entering pipeline to generate output), there will be a high probability that there will be multiple binary tree branch points in the pipeline, so before the result of the conditional jump instruction, after the second binary tree branch point enters the pipeline, it can still only be 50% correct sex probability assumptions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Implementation method of pipeline pre-branching
  • Implementation method of pipeline pre-branching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] One embodiment of the present invention, kernel enters " binary tree mode ", carries out ABTP processing to first BTBP, comprises steps:

[0036] Step 1. The program executes the PMR instruction, all PLs enter the "binary tree mode", and the PRC enters the "binary tree mode", such as figure 1 shown, except PL 0 other than PL 1 、PL 2 , and PL 3 Both enter PL off status, line B reflects the information that there is available PL off , and reflect the available PL off The PLI is 1;

[0037] Step 2, such as figure 2 As shown, BTP encounters the first BTBP, hereinafter abbreviated as BTBP 0 , the BTB F The instruction address information and related status information of the instruction are transmitted to the PL 1 ,PL 1 Start executing BTB F (later the BTB F Abbreviated as BTB 1 ), PL 0 execute BTB T (later the BTB T Abbreviated as BTB 0 ). At the same time, PRC updates the information record about PL, and line B reflects the information that PL is availa...

Embodiment 2

[0039] An embodiment of the present invention, on the basis of embodiment 1, PL 0 BTBP not yet drawn 0 Judgment result, PL 1 Encounter a BTBP (hereafter abbreviated as BTBP 1 ), PL 1 Perform ABTP processing, including steps:

[0040] Step 1, PL 1 Change the current PL 1 BTB on F The instruction address information and related status information of the instruction are transmitted to the PL 2,PL 2 Start executing BTB F (later the BTB F Abbreviated as BTB 2 ), PL 1 execute BTB T (at this time the BTB T BTB 1 continuation of). At the same time, PRC updates the information record about PL, and line B reflects the information that PL is available off , and reflect the available PL off The PLI is 3.

Embodiment 3

[0042] An embodiment of the present invention, on the basis of embodiment 2, PL 0 BTBP not yet drawn 0 Judgment result, PL 1 BTBP not yet drawn 1 Judgment result, PL 0 Encounter a BTBP (hereafter abbreviated as BTBP 2 ), PL 0 Perform ABTP processing, including steps:

[0043] Step 1, PL 0 Change the current PL 0 BTB on F The instruction address information and related status information of the instruction are transmitted to the PL 3 ,PL 3 Start executing BTB F (later the BTB F Abbreviated as BTB 3 ), PL 0 execute BTB T (at this time the BTB T BTB 0 continuation of). At the same time, PRC updates the information record about PL, line B reflects the information that there is no available PL off .

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an implementation method of pipeline pre-branching. According to the implementation method of pipeline pre-branching, for a processor core, the complexity of a pipeline usually determines the performance of a core instruction set architecture, and even determines a market area suitable for a processor. In fact, the implementation complexity of assembly line hardware is notdecided by an instruction set, but aiming at different market areas, kernel assembly lines with different complexity degrees are needed, and then a corresponding instruction set architecture is provided. Therefore, the viewpoint that any one of the RISC and the CISC can replace the other one is unscientific unless the related application market disappears. The circuit structure of the short assembly line is easier to obtain the low-end processor market, and the complex ultra-long assembly line is more suitable for the high-end processor market. However, the hardware structure of the ultra-long assembly line can bring a very troublesome pre-branching problem, and the technical scheme of the invention can better solve the pre-branching problem for the structure of the ultra-long assembly line, so that the structure of the ultra-long assembly line has higher efficiency.

Description

technical field [0001] The invention relates to the field of integrated circuits and computers, in particular to a method for realizing pipeline pre-branching. Background technique [0002] Under the current background technical conditions, there are two different processor instruction set architectures, namely RISC and CISC (that is, "reduced instruction set" and "complex instruction"). However, in fact, the real difference is the hardware implementation of the instruction sets of the two camps, that is, the circuit implementation of the pipeline. [0003] The hardware corresponding to RISC implements the basic operations required by programs such as operations, instruction / data reading, and data writing based on more "programming registers". All operations of RISC are completed based on "programming registers", so the goal of RISC is actually to make the circuit structure simpler, so the core pipeline structure adopted by the RISC camp is a simple pipeline structure, that...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/38
CPCG06F9/3867
Inventor 王志平
Owner 王志平
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products