Text classification method mixing long-short-term memory network with convolutional neural network

A convolutional neural network, long-term and short-term memory technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as high cost, time-consuming and labor-intensive, and no consideration of contextual information of language units in sentences, etc., to achieve Good versatility, improved accuracy, good effect

Inactive Publication Date: 2017-09-15
SOUTH CHINA UNIV OF TECH
View PDF4 Cites 88 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Before the 1990s, automatic text classification was mainly based on knowledge engineering, that is, manual classification by professionals, and its disadvantages were high cost, time-consuming and laborious
Existing research and applications have proved that recurrent neural networks are suitable for learning long-term dependencies betwee

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method mixing long-short-term memory network with convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0025] The present embodiment provides a text classification method of mixing long-short-term memory network and convolutional neural network, and the method comprises the following steps:

[0026] Step 1. Preprocessing the sentences in the text, including punctuation filtering, abbreviation completion, deleting spaces, sentence segmentation and illegal character filtering, combined with the length distribution and mean square error of sentences in the training corpus, after determining the sentence length threshold Form a unified sentence length, use the pre-trained word vector table to obtain the vectorized representation of each word in the input text, and form a continuous and dense real vector matrix;

[0027] Step 2. For the input sentence word vector, a forward LSTM network is used to learn the above information of each word and a reverse LSTM network is used to learn the following information of each word, and the learning results are serially merged, so as to include ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification method mixing a long-short-term memory network with a convolutional neural network. The advantage of the bidirectional long-short-term memory network in the aspect of learning context information of a text and the advantage of the convolutional neural network in the aspect of learning local features of the text are fully combined; the context information of words is learnt by utilizing the bidirectional long-short-term memory network; the local features of word vectors of the context information are further learnt and extracted through the convolutional neural network; contexts of the local features are learnt by utilizing the bidirectional long-short-term memory network to form a fixed-dimension output; and finally classification output is performed through a multilayer sensor. The accuracy of model classification can be further improved; relatively high universality is achieved; and good effects are achieved in a plurality of tested corpus libraries.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text classification method of a hybrid long-short-term memory network and a convolutional neural network. Background technique [0002] Automatic text classification based on machine learning is the most popular research direction in the field of natural language processing in recent years. and in-depth applications. The so-called automatic text classification refers to the process of automatically determining the text category after analyzing the content of the text by using machine learning under the premise of a given classification system. Before the 1990s, automatic text classification was mainly based on knowledge engineering, that is, manual classification by professionals, and its disadvantages were high cost, time-consuming and labor-intensive. Since the 1990s, many researchers have begun to apply various statistical methods and machine learning methods to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06N3/045
Inventor 苏锦钿霍振朗欧阳志凡
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products