Loading/memorizing unit of embedded processor

A technology of embedded processors and storage units, applied in memory systems, electrical digital data processing, instruments, etc., can solve problems such as blocking normal execution, and achieve the effect of improving performance

Active Publication Date: 2009-09-09
C SKY MICROSYST CO LTD
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the existence of cache access misses, once access misses occur, data loading / storage needs to be loaded / stored from the off-chip memory through an external bus. This process requires several or even dozens of instruction cycles.
Since the current mainstream embedded high-performance processors generally adopt a pipeline structure, the pause of the previous instruction will block the normal execution of the entire pipeline.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Loading/memorizing unit of embedded processor
  • Loading/memorizing unit of embedded processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The present invention will be further described below in conjunction with the accompanying drawings.

[0020] refer to figure 1 with figure 2 , a load / store unit of an embedded processor. In the load / store unit, the read and write instructions in a fully pipelined manner are used to access the on-chip cache. Instructions are saved to a dedicated access miss buffer and pipeline resources are released, and subsequent instructions continue to access the cache.

[0021] An independent off-chip update buffer is set to cache the write operation of the off-chip memory to quickly retire the off-chip memory write operation; an independent on-chip update buffer is set to cache the on-chip cache update operation to quickly retire the cache write operation.

[0022] This embodiment adopts the full pipeline method to realize the access of the read and write instructions to the on-chip cache: the full pipeline method to realize the access of the read and write instructions to the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a loading/memorizing unit of an embedded processor; in the loading/memorizing unit, an entire pipeline moe is adopted to read and write command to visit high-speed cache on the disc; if the visit absence occurs in a preorder command, the command resulting in the visit absence is memorized in a special visit absence buffer, the pipeline resource is released and the sequent command continues to visit the high-speed cache; an independent update buffer outside the disc is arranged to cache the write operation of a memory outside the disc to cause the fast retirement of the write operation of the memory outside the disc; an independent update buffer on the disc is arranged to cache the high-speed cache update operation on the disc to cause the fast retirement of the high-speed cache write operation. When the high-speed cache visit absence occurs, the invention can not block the pipeline, thus improving the performance of the processor.

Description

technical field [0001] The invention relates to a load / store unit of an embedded processor. Background technique [0002] In recent years, with the development of high-performance embedded processors, in order to improve the utilization of internal logic components and improve the performance of the entire processor, various methods such as multi-instruction launch and out-of-order execution are usually adopted. Some superscalar processors that are commonly used today can usually execute multiple instructions concurrently in one instruction cycle. After the processor prefetches a batch of instructions from the instruction cache, it will analyze those instructions that can be executed in parallel without associativity, and then send them to several independent execution units for parallel execution. [0003] Generally speaking, load / store instructions (executed by the load / store unit) require multiple instruction cycles, while general arithmetic instructions are usually exec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/38G06F12/08G06F12/0831G06F12/0842
Inventor 严晓浪丁永林葛海通孟建熠
Owner C SKY MICROSYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products