Text multi-label classification method and system based on attention mechanism and GCN

A classification method and attention technology, applied in the direction of text database clustering/classification, neural learning method, unstructured text data retrieval, etc. Classification accuracy and other issues to achieve the effect of improving feature extraction capabilities

Pending Publication Date: 2021-04-27
HUNAN UNIV
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the above defects or improvement needs of the prior art, the present invention provides a text multi-label classification method and system based on attention mechanism and GCN. part, leading to technical problems that affect the accuracy of text classification, and due to the convolution operation of convolutional neural networks, it is easy to ignore the semantic relationship of text, thus affecting the technical problems of text classification accuracy, and because most of them adopt data-driven methods, It ignores the role of different parts of the text in predicting different labels, which leads to technical problems affecting the accuracy of text classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text multi-label classification method and system based on attention mechanism and GCN
  • Text multi-label classification method and system based on attention mechanism and GCN
  • Text multi-label classification method and system based on attention mechanism and GCN

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0072] The basic idea of ​​the present invention is to use the attention mechanism to construct the semantic correlation between texts, words, and tags. On the one hand, the text and tag information are summarized to form a new text word representation, and the text features are extracted more fully. On the other hand, On the one hand, text and word information are aggregated to form a new label...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text multi-label classification method based on an attention mechanism and a GCN, and the method comprises the steps: obtaining a to-be-classified text, carrying out the preprocessing of the to-be-classified text, and converting the preprocessed to-be-classified text into a multi-dimensional vector through a Glove pre-training word vector; and inputting the obtained multi-dimensional vector into a pre-trained classification model to obtain a classification result of the to-be-classified text. According to the method, semantic correlation among texts, words and labels is constructed by utilizing an attention mechanism; on one hand, texts and label information are summarized to form a new text word representation form, and text feature extraction is more sufficiently carried out; on the other hand, texts and word information are summarized to form a new label representation form, and a graph neural network is used to carry out correlation modeling of the labels. And the text multi-label classification effect is improved from two angles.

Description

technical field [0001] The present invention belongs to the technical field of natural language processing, and more specifically relates to a text multi-label classification method and system based on an attention mechanism and a Graphic convolutional network (GCN for short). Background technique [0002] With the development of society and network technology, there are massive information resources in the form of text. How to effectively classify these texts and quickly, accurately and comprehensively mine effective information has become one of the hotspots in the field of natural language processing research. Text classification refers to determining a category for each document in a document collection, and there are a wide range of application scenarios. As a difficulty in the field of text classification, multi-label classification has also attracted a lot of attention. Most of the current multi-label classification methods consider two parts: [0003] 1. Feature ex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F16/35G06N3/04G06N3/08
CPCG06F40/30G06F16/35G06N3/08G06N3/045
Inventor 刘孝炎肖正郭修远王立峰
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products