Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification method and system

A text classification and text technology, applied in the direction of text database clustering/classification, unstructured text data retrieval, biological neural network model, etc. Extracting long sequence semantic information and other issues to achieve the effect of improving accuracy

Inactive Publication Date: 2019-06-11
BEIJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Recurrent neural networks are slow to train because the hidden layers are recurrent
Moreover, in long sequences, as time goes by, the features extracted by the hidden layer are more biased towards the later input information, which will affect the extraction effect of RNN on global semantic information, while the features of each part of the input information extracted in CNN are equal. or no preference
[0005] In summary, CNN is only good at extracting local semantic information, and is not suitable for extracting long sequence semantic information. RNN is suitable for analyzing long sequence global semantic information, but as time goes by, the features extracted by the hidden layer are more inclined to the later input information, which affects The effect of extracting global semantic information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and system
  • Text classification method and system
  • Text classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0028] In order to better describe the text classification method provided by the embodiment of the present invention, the structure of the text classification model is first described. figure 1 A schematic structural diagram of a text classification model provided by an embodiment of the present invention, such as figure 1 As show...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text classification method and system. The method comprises the following steps: obtaining a to-be-classified text; inputting the to-be-classified text intoa text classification model, and obtaining the category of the to-be-classified text according to the output result of the text classification model; wherein the text classification model is formed based on a plurality of convolutional neural networks and a plurality of recurrent neural networks, and is obtained by training based on a sample text and a predetermined text category label. The embodiment of the invention provides a method and system. A CNN and an RNN are fused to form a text classification model; texts are classified through the text classification model; The method overcomes thedefect that the CNN is not suitable for extracting long sequence semantic information and the defect that the RNN has an extraction effect on long sequence global semantic information, and can be suitable for extracting local semantic information and long sequence global semantic information, so that the text classification accuracy is greatly improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of natural language processing for artificial intelligence deep learning, and in particular, to a text classification method and system. Background technique [0002] Deep learning is the best way to achieve text classification tasks, among which Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) are the most widely used feature extraction methods in deep learning. [0003] CNN can be traced back to the proposal of the back propagation (BP) algorithm in 1987, and then Yann LeCun used it in multi-layer neural networks in 1989, until Yann LeCun proposed the LeNet-5 model in 1998, CNN gradually revealed prototype. CNN can be divided into three parts: input layer, hidden layer and output layer. In natural language processing, the input information of the input layer is represented as an input sequence by a pre-trained word embedding model or word vector. Commonly used...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N3/04
Inventor 双锴胡皓张文涛姚云腾
Owner BEIJING UNIV OF POSTS & TELECOMM