Text processing method and device, computer readable storage medium and electronic equipment

A text processing and text technology, applied in the fields of text processing devices, computer-readable storage media and electronic equipment, and text processing methods, can solve problems such as data imbalance, poor accuracy, and lack of optimization, so as to improve accuracy and stability performance, improve efficiency and accuracy, and avoid the effect of manual labeling

Active Publication Date: 2020-10-02
TENCENT TECH (SHENZHEN) CO LTD
View PDF7 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the multi-label automatic labeling of text is realized through the classification model. When training the classification model, the samples (text, image or audio) are converted into feature vectors for multi-label classification learning, but the existing methods rely on complete The training data set requires expensive human labeling as support, which greatly limits the expansion of the field and the iteration speed of the project. In addition, the existing methods have not optimized the problem of data imbalance, resulting in the classification model obtained after training. The accuracy of multi-label classification is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, computer readable storage medium and electronic equipment
  • Text processing method and device, computer readable storage medium and electronic equipment
  • Text processing method and device, computer readable storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete and will fully convey the concept of example embodiments to those skilled in the art.

[0041] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of embodiments of the present disclosure. However, those skilled in the art will appreciate that the technical solutions of the present disclosure may be practiced without one or more of the specific details, or other methods, components, means, steps, etc. may be employed. In other instances, well-known methods, ap...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text processing method and device, a computer storage medium and electronic equipment, and relates to the field of artificial intelligence. The method comprises the followingsteps: obtaining text to be processed, inputting the text to be processed into a multi-label classification model, wherein the multi-label classification model is obtained by training based on an unbalanced text sample set and an unbalanced attenuation loss function, the unbalanced text sample set is a text sample set in which the number of label positive samples and the number of label negative samples are unbalanced, and the unbalanced attenuation loss function comprises a first loss part, a second loss part and recall loss; performing attribute extraction on the to-be-processed text throughthe multi-label classification model to obtain a label corresponding to the to-be-processed text; and obtaining a corresponding entity from the to-be-processed text according to the label, and constructing a triad according to the label and the entity so as to update the knowledge graph according to the triad. According to the invention, the on-call rate of the text label can be improved and thecost is reduced.

Description

technical field [0001] The present disclosure relates to the technical field of artificial intelligence, and in particular, to a text processing method, a text processing device, a computer-readable storage medium, and electronic equipment. Background technique [0002] With the rapid development of science and technology and artificial intelligence, text classification has become an important aspect. For text, there may be more than one label corresponding to it, and there may be multiple. For example, an article introducing a certain character is likely to Describe the characters' biography, family relationship, social contribution, etc., so it should at least contain tags such as father, mother, place of birth, and residence. It can be seen that multi-label classification can provide richer classification information and provide greater help for possible subsequent applications, such as text classification management, monitoring, filtering, and so on. [0003] At present...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N3/04G06N20/00
CPCG06F16/35G06N20/00G06N3/045
Inventor 张倩汶闫昭饶孟良曹云波
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products