Variable format, variable sparsity matrix multiplication instruction

A sparse matrix and matrix technology, applied in the field of computer processor architecture, can solve problems such as lack of flexibility
CN110580175APending Publication Date: 2019-12-17INTEL CORP

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
INTEL CORP
Publication Date
2019-12-17

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

Disclosed embodiments relate to a variable format, variable sparsity matrix multiplication (VFVSMM) instruction. In one example, a processor includes fetch and decode circuitry to fetch and decode a VFVSMM instruction specifying locations of A, B, and C matrices having (M x K), (K x N), and (M x N) elements, respectively, execution circuitry, responsive to the decoded VFVSMM instruction, to: routeeach row of the specified A matrix, staggering subsequent rows, into corresponding rows of a (M x N) processing array, and route each column of the specified B matrix, staggering subsequent columns,into corresponding columns of the processing array, wherein each of the processing units is to generate K products of A-matrix elements and matching B-matrix elements having a same row address as a column address of the A-matrix element, and to accumulate each generated product with a corresponding C-matrix element.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The field of the invention relates generally to computer processor architecture and, in particular, to variable format, variable sparse matrix multiply instructions. Background technique

[0002] Machine learning architectures such as deep neural networks have been applied in domains including computer vision, speech recognition, natural language processing, audio recognition, social network filtering, machine translation, bioinformatics, and drug design. Deep learning is a class of machine learning algorithms. Maximizing the flexibility and cost-efficiency of deep learning algorithms and computations can help meet the needs of deep learning processors, such as those performing deep learning in data centers.

[0003] Matrix multiplication is a critical performance / power limitation of many algorithms, including machine learning. Some traditional matrix multiplication methods are specialized, such as they lack the flexibility to support various data fo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More