Variable format, variable sparsity matrix multiplication instruction

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A sparse matrix and matrix technology, applied in the field of computer processor architecture, can solve problems such as lack of flexibility

Pending Publication Date: 2019-12-17

INTEL CORP

View PDF0 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Some traditional matrix multiplication methods are specialized, such as they lack the flexibility to support various data formats (signed and unsigned 8b / 16b integers, 16b floating point) with wide accumulators, and the flexibility to support dense and sparse matrices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] In the following description, numerous specific details are set forth. However, it is understood that some embodiments may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in detail in order not to obscure the understanding of this description.

[0037] References in the specification to "one embodiment," "an embodiment," "exemplary embodiment," etc. indicate that the described embodiments may include a feature, structure, or characteristic, but that each embodiment may not necessarily include the feature, structure, or characteristic. structure or feature. Moreover, these phrases are not necessarily referring to the same embodiment. Furthermore, when a feature, structure or characteristic is described with respect to an embodiment, it is considered to be within the knowledge range of those skilled in the art to affect such a feature, structure or characteristic with respect to other em...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Disclosed embodiments relate to a variable format, variable sparsity matrix multiplication (VFVSMM) instruction. In one example, a processor includes fetch and decode circuitry to fetch and decode a VFVSMM instruction specifying locations of A, B, and C matrices having (M x K), (K x N), and (M x N) elements, respectively, execution circuitry, responsive to the decoded VFVSMM instruction, to: routeeach row of the specified A matrix, staggering subsequent rows, into corresponding rows of a (M x N) processing array, and route each column of the specified B matrix, staggering subsequent columns,into corresponding columns of the processing array, wherein each of the processing units is to generate K products of A-matrix elements and matching B-matrix elements having a same row address as a column address of the A-matrix element, and to accumulate each generated product with a corresponding C-matrix element.

Description

technical field [0001] The field of the invention relates generally to computer processor architecture and, in particular, to variable format, variable sparse matrix multiply instructions. Background technique [0002] Machine learning architectures such as deep neural networks have been applied in domains including computer vision, speech recognition, natural language processing, audio recognition, social network filtering, machine translation, bioinformatics, and drug design. Deep learning is a class of machine learning algorithms. Maximizing the flexibility and cost-efficiency of deep learning algorithms and computations can help meet the needs of deep learning processors, such as those performing deep learning in data centers. [0003] Matrix multiplication is a critical performance / power limitation of many algorithms, including machine learning. Some traditional matrix multiplication methods are specialized, such as they lack the flexibility to support various data fo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F9/30G06F7/523G06F17/16

CPCG06F9/30036G06F9/30145G06F7/523G06F17/16G06F9/30038G06N3/063G06F9/3001G06F9/3016G06N20/00

Inventor马克·A·安德斯希曼殊·考尔萨努·马修

OwnerINTEL CORP

Variable format, variable sparsity matrix multiplication instruction

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A sparse matrix and matrix technology, applied in the field of computer processor architecture, can solve problems such as lack of flexibility

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A sparse matrix and matrix technology, applied in the field of computer processor architecture, can solve problems such as lack of flexibility

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology