Array multiple with reduced bandwidth requirement
A technique for operands and program instructions, which is applied in the field of reducing memory bandwidth and can solve problems such as limiting the overall computing performance of matrix multiplication
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0013] In the following description, numerous specific details are set forth to provide a more thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without one or more of these specific details. In other instances, well-known features have not been described in order to avoid obscuring the present invention.
[0014] 1A illustrates a conceptual diagram of matrix A 101 and matrix B 102 that are multiplied to produce matrix C 103 in accordance with one or more aspects of the disclosure. Conventionally, a dot product is computed using the elements in the rows of matrix A 101 and the columns of matrix B 102 to produce the elements in the columns of matrix C 103 . For example, elements in row 107 of matrix A 101 and elements in column 105 of matrix B 102 (eg, 131 , 132 and 146 ) are used to generate element 152 in column 104 of matrix C 103 . When multiple threads of execution are used...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 