Unlock instant, AI-driven research and patent intelligence for your innovation.

A text classification method and device

A text classification and text technology, applied in text database clustering/classification, unstructured text data retrieval, biological neural network model, etc., can solve the problem of low accuracy and achieve the effect of improving accuracy

Active Publication Date: 2020-09-29
BEIJING SOHU NEW MEDIA INFORMATION TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the object of the present invention is to provide a text classification method and device, to solve the problem of low accuracy in the method of realizing text classification based on CNN model in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text classification method and device
  • A text classification method and device
  • A text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0068] This embodiment discloses a text classification method, which is applied in the scene of long text classification, for example, the scene of news classification, see figure 1 , the embodiment includes the following steps:

[0069] S101. Preprocessing the text to be classified to obtain multiple sentences;

[0070] When the text to be...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a text classification method and device. On the basis of the pre-established CNN classification model, the method of initializing the weight of the convolutional layer is improved. Specifically, the weight is initialized according to the Gaussian distribution. Compared with the existing CNN-based The classification model realizes the method of text classification, which improves the accuracy of classification results. And compared with Naive Bayesian, machine learning algorithms such as SVM also improve the accuracy of classification results.

Description

technical field [0001] The present invention relates to the technical field of classification, in particular to a text classification method and device. Background technique [0002] The method for implementing text classification in the prior art is: extracting text features of the text to be classified, and classifying the text to be classified according to the text features. [0003] Based on the text classification methods disclosed in the prior art, when classifying news, since the news is a long text, when extracting the text features of the news to be classified, it is often necessary to invest a lot of manpower and time to design effective text features. Helping with classification is time-consuming and labor-intensive. [0004] Since deep learning can automatically learn text features, it can solve the problem of difficult text feature extraction when classifying long texts such as news. The convolutional neural network model (CNN) is commonly used in deep learnin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06N3/04
CPCG06F16/355G06N3/045
Inventor 陈嘉慧刘海龙郭亚南
Owner BEIJING SOHU NEW MEDIA INFORMATION TECH