Text classification method based on word sense disambiguation convolutional neural network

A neural network and text classification technology, applied in the field of text classification based on word sense disambiguation convolutional neural network, can solve the problems of slow text classification efficiency, low text classification accuracy, complex text classification methods, etc., to improve text classification efficiency , Optimizing the effect of text classification methods

Inactive Publication Date: 2019-10-11
厦门美域中央信息科技有限公司
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The current text classification method is relatively complex, the efficien

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method based on word sense disambiguation convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be understood that these descriptions are exemplary only, and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0037] like figure 1 As shown, a kind of text classification method based on word sense disambiguation convolutional neural network that the present invention proposes, comprises the following steps:

[0038] S1. Configure an ambiguous thesaurus with determined word meanings;

[0039] S2. Obtain relevant files, extract text content from the files, and perform word segmentation processing on each sentence in the text;

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification method based on a word sense disambiguation convolutional neural network. The method comprises the following steps of configuring an ambiguous word bank with determined word senses; obtaining a related file, extracting text content from the file, and performing word segmentation processing on each statement in the text; determining the part-of-speech of each word in the statement; determining a disambiguation target word; determining the meaning of the target word and performing disambiguation processing; performing word segmentation processing andstop word removal processing on original statements contained in the disambiguated text to obtain target statements corresponding to the original statements; determining the criticality of words in the target statement; determining the criticality of the target statement; sorting the statements according to the criticality of the statements to obtain a target text; and utilizing the trained textclassification model based on the convolutional neural network to classify the target text. According to the method, text classification can be carried out based on the word sense disambiguation convolutional neural network, the text classification method is optimized, the text classification efficiency and the text classification accuracy are improved, and time and labor are saved.

Description

technical field [0001] The invention relates to the technical field of text classification, in particular to a text classification method based on word sense disambiguation convolutional neural network. Background technique [0002] With the increasing development of network media and the increasing number of netizens, a large amount of text data is constantly being generated. How to deal with the huge text data and correctly classify it is an urgent problem to be solved. Text classification uses existing data to train classifiers, and Use this classifier to test documents to determine the category of each document. Correct text classification can enable users to find the information they need faster and browse documents more conveniently. Automatic text classification refers to training text with category marks. That is, train a text classifier, and then use the classifier to test unknown categories of text for recognition; [0003] In the existing technology, text classif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F16/36G06F16/33G06F17/21G06F17/27
CPCG06F16/35G06F16/374G06F16/3347G06F40/117G06F40/211G06F40/289
Inventor 肖清林
Owner 厦门美域中央信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products