Training method and device of knowledge pre-training model and electronic equipment

A training method and a technology of a training device, which are applied in the training of knowledge pre-training models and in the field of computer program products, can solve problems such as model output errors, models that do not have common sense reasoning capabilities, and joint training of common sense learning and semantic learning that cannot be realized.

Active Publication Date: 2021-03-16
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002]Currently, most of the models are not capable of commonsense reasoning, for example, if the question is "with what can use ink to make a copy of a document on paper", the answer includes pen, photocopier , carbon paper, notebook, people can correctly choose the answer of the copier based on common sense, however, due to the high frequency of co-occurrence of carbon paper and the carbon and paper in the question, the model is likely to choose the answer of the carbon paper, resulting in the wrong result of the model output
The model training method in the related art cannot realize the joint training of common sense learning and semantic learning, and the model gain is limited by the quality of the sample, and often needs to retrain the model, which is less flexible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method and device of knowledge pre-training model and electronic equipment
  • Training method and device of knowledge pre-training model and electronic equipment
  • Training method and device of knowledge pre-training model and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0018] Speech can include technical fields such as speech recognition and speech interaction, and is an important direction in the field of artificial intelligence.

[0019] Voice Recognition (Voice Recognition) is a technology that allows machines to convert voice signals into corresponding text or commands through the process of recognition and understanding. It mainl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a training method and device of a knowledge pre-training model and electronic equipment, and relates to the technical field of voice, natural language processing and deep learning. According to the specific implementation scheme, a training text is acquired, the training text comprises a structured knowledge text and a corresponding article, and the structured knowledge text comprises a head node, a tail node and a relationship between the head node and the tail node; and the knowledge pre-training model is trained to be trained according to the training text. Accordingto the method, the knowledge pre-training model to be trained can learn common knowledge and rich semantic knowledge at the same time, joint training of the common knowledge and the semantic knowledge can be achieved, and a training entity does not need to be embedded into the knowledge pre-training model to be trained; the performance gain of the knowledge pre-training model is not limited by the embedding quality of the training entity, the knowledge pre-training model can obtain rich context information from articles in the training text, dynamic adjustment can be carried out, and the flexibility is high.

Description

technical field [0001] The present disclosure relates to the technical fields of language, natural language processing, and deep learning in the field of computer technology, and in particular to a training method, device, electronic equipment, storage medium, and computer program product of a knowledge pre-training model. Background technique [0002] At present, most of the models do not have common sense reasoning ability. For example, if the question is "using what can use ink to copy a document on paper", the answer includes pen, copier, carbon paper, notebook, and people can correctly choose the answer of copier based on common sense, However, due to the high co-occurrence frequency of carbon paper with the carbon and paper in the question, the model is likely to choose the answer of carbon paper, causing the model to output wrong results. The model training method in the related art cannot realize the joint training of common sense learning and semantic learning, and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/279G06F40/30G06F16/36G06K9/62G06N20/20
CPCG06F40/279G06F40/30G06F16/367G06N20/20G06F18/214G06N5/022G06N20/00G06N5/04
Inventor 庞超王硕寰孙宇李芝
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products