Unlock instant, AI-driven research and patent intelligence for your innovation.

Pre-training model reasoning processing method and device, electronic equipment and storage medium

A processing method and pre-training technology, applied in inference methods, character and pattern recognition, instruments, etc., can solve problems such as high cost and achieve the effect of high processing speed

Active Publication Date: 2022-06-28
北京智源人工智能研究院
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a pre-training model reasoning processing method, device, electronic equipment and storage medium, which are used to solve the high-cost defects existing in the reasoning process of large-scale pre-training models in the prior art, and realize large-scale pre-training models Low cost and high processing speed during inference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pre-training model reasoning processing method and device, electronic equipment and storage medium
  • Pre-training model reasoning processing method and device, electronic equipment and storage medium
  • Pre-training model reasoning processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are part of the embodiments of the present invention. , not all examples. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0028] In recent years, large-scale pre-trained models have become a research hotspot. For example, large-scale pre-trained language models have become a research hotspot in the field of natural language processing. Among them, a large-scale pre-training model refers to a model with more than one billion model parameters.

[0029] For the convenience of description, in this application, the large-scale pre-trainin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a pre-training model reasoning processing method and device, electronic equipment and a storage medium, the method is applied to a server for carrying out reasoning processing on a to-be-processed model, the method comprises the steps that the to-be-processed model is determined, the to-be-processed model is expressed by adopting a high-bit floating-point number and is obtained through pre-training, and the to-be-processed model is used for carrying out reasoning processing on the to-be-processed model; the digit of the high-bit floating-point number is greater than or equal to a first digit threshold; based on a model quantification technology, model parameters of the to-be-processed model are converted from high-bit floating-point number representation to low-ratio specific point number representation so as to achieve accelerated reasoning processing on the to-be-processed model, and the digit of the low-ratio specific point number is smaller than or equal to a second digit threshold value. By means of the pre-training model reasoning processing method, low cost and high processing speed of the large-scale to-be-processed model in the reasoning process are achieved.

Description

technical field [0001] The present invention relates to the field of model processing, in particular to a pre-training model inference processing method, device, electronic device and storage medium. Background technique [0002] In recent years, large-scale pre-trained models have become a research hotspot. For example, large-scale pre-trained language models have become a research hotspot in the field of natural language processing. The related technologies of pre-training language models make it possible to train large-scale models including tens of billions or even hundreds of billions of parameters (such as Open AI GPT3, Zhiyuan Enlightenment 2.0 model, etc.). These large-scale models have achieved amazing results in many natural language processing tasks and have attracted the continuous attention of many researchers. [0003] Although large-scale pre-trained language models perform amazingly on multiple tasks, their large parameter size also brings great challenges t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N5/04
CPCG06N5/04G06F18/214Y02D10/00
Inventor 贾超郑直
Owner 北京智源人工智能研究院