Method, equipment and medium for realizing data arrangement

A data column and data technology, applied in the field of data arrangement and computer-readable storage media, can solve the problems of wasting data bandwidth of instruction memory, long delay of read and write operations of shared memory, and increase of instruction read delay.

Pending Publication Date: 2022-05-27
SHANGHAI BIREN TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method increases the read delay of the instruction and wastes the data bandwidth of the instruction memory.
In addition, the index calculation of the shared memory is more complex and needs to be converted into additional calculations for the corresponding addresses
[0004] To sum up,...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, equipment and medium for realizing data arrangement
  • Method, equipment and medium for realizing data arrangement
  • Method, equipment and medium for realizing data arrangement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, including various details of the present disclosure embodiments to facilitate understanding, and they should be considered merely exemplary. Accordingly, those of ordinary skill in the art should recognize that various changes and modifications may be made to the embodiments described herein, without departing from the scope and spirit of the present disclosure. Similarly, for the sake of clarity and brevity, the description of the well-known functions and structures is omitted from the following description.

[0027] As used herein, the term "comprising" and its variants indicate openness including, i.e., "including but not limited to". Unless otherwise stated, the terms "or" means "and / or". The term "based on" means "at least partially based on". The term "an exemplary embodiment" and "an embodiment" means "at least one exemplary embodiment". The term "...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention relates to a method and device for achieving data arrangement and a medium, and the method comprises the steps that first auxiliary data and second auxiliary data are obtained based on the thread count of a thread group and the data arrangement requirement; shifting the data in the first register, the first auxiliary data, the data in the second register and the second auxiliary data to a transfer register; extracting the data in the transfer register based on the address of the transfer register so as to obtain first data, second data, third data and fourth data; determining a mapping parameter based on the thread count of the thread group and the data arrangement requirement; and based on the mapping parameter, mapping the third data to the first register and mapping the fourth data to the second register, thereby realizing data arrangement in the registers. Therefore, data arrangement among a plurality of registers can be realized more quickly under the condition that the shared memory address is not calculated.

Description

Technical field [0001] Embodiments of the present disclosure relate in general to the processor field, and more particularly relate to a method for implementing data arrangement, computing devices and computer-readable storage media. Background [0002] In general, in general-purpose computing on graphicsprocessing units (GPGPUs), fast Fourier (FFT) operations with inputs as real numbers and outputs as complex numbers (R2C) are usually involved. The operation requires a sequence of real numbers of the original input (for example, parity rearrangement) to be rearranged into a new sequence of complex numbers. This involves arranging data between groups of threads (warps) between different registers. Permutation operations rely on data interaction between thread groups. [0003] The way data interacts between existing thread groups is primarily through shared storage. The system calculates the corresponding shared memory address according to the index of different thread data, and t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F7/24G06F9/30
CPCG06F7/24G06F9/3013
Inventor 不公告发明人
Owner SHANGHAI BIREN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products