Model training method and text information processing method, system and device and storage medium

A technology of training text and training methods, applied in the fields of electrical digital data processing, biological neural network model, natural language data processing, etc., can solve problems such as high difficulty, high acquisition cost, and low labeling efficiency

Pending Publication Date: 2021-09-14
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, training a deep learning model requires a large number of training samples. These training samples need to be labeled manually. For keyword labeling, it is difficult and the labeling efficiency is relatively low, so the acquisition cost is high
The same words have different key degrees in different contexts. Therefore, if you need to obtain a model that can accurately predict the weight of words in sentences in different contexts, you need to use a large number of training samples, and the cost comparison high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and text information processing method, system and device and storage medium
  • Model training method and text information processing method, system and device and storage medium
  • Model training method and text information processing method, system and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

[0077] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein are only for the purpose of describing the embodiments of the present application, and are not intended to limit the present application.

[0078] Before further describing the embodiments of the present application in detail, the nouns and terms involved in the embodiments of the present application are described, and the nouns and terms involved in the embodiments of the present application are applicable to the following explanations.

[0079] Artificial Intell...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a training method of a text information processing model, a text information processing method, system and device and a storage medium, which can be applied to the field of artificial intelligence; the training method comprises the following steps: coding a first training text to obtain first semantic information; encoding the first candidate word to obtain second semantic information; predicting classification of the first training text according to the first semantic information to obtain a first classification result; predicting the weight of the first candidate word according to the first semantic information and the second semantic information; determining a first loss value according to the first classification result and the type label of the first training text; determining a second loss value according to the weight of the first candidate word and the weight label of the first candidate word; and training the text information processing model according to the first loss value and the second loss value. According to the method, the training cost can be reduced or higher model precision can be obtained under the condition of the same training cost.

Description

technical field [0001] The present application relates to artificial intelligence technology, especially a text information processing model training method, text information processing method, system, device and storage medium. Background technique [0002] At present, word weighting tasks are mostly completed in two ways: statistics and text classification. The statistics are unsupervised, and the representative method is TF-IDF (term frequency–inverse document frequency, a commonly used weighting technology for information retrieval and data mining. ), mutual information MI (mutual information, is a commonly used information measure in information theory). There is a supervised method, represented by deep learning. Compared with the statistical method, the deep learning model can learn the relationship between keywords and sentences, so it is more accurate in the task of keyword extraction. [0003] However, training a deep learning model requires a large number of train...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06N3/04
CPCG06F40/289G06F40/30G06N3/044G06N3/045
Inventor 黄剑辉
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products