The invention discloses a large-scale multi-operation floating point matrix calculation acceleration implementation method, which comprises the following steps: S1, receiving an external input signal and judging a matrix operation mode according to an operation type of a to-be-processed matrix: when the operation mode is matrix addition and matrix subtraction, turning to execute a step S3, and when the operation mode is matrix subtraction, turning to execute a step S4; when the operation mode is matrix multiplication, matrix-vector multiplication and matrix-scalar multiplication, turning to execute the step S2; s2, initializing an on-chip RAM (Random Access Memory) to be zero, and turning to execute a step S4; s3, the data source C is loaded into the on-chip RAM through the RAM channel, and the step S4 is executed; s4, pre-loading a part of the data stream A through an RAM channel, and loading the data stream A and the data stream B while calculating; s5, after calculation is completed, a calculation result is transmitted to the off-chip memory. The device is used for implementing the method. The method has the advantages of low storage requirement, high calculation efficiency, high reusability, wide application range and the like.