Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text classification method and device

A text classification and text technology, applied in the field of text processing, can solve the problems of poor classification effect of text classification scheme, inability to accurately determine the actual opinion category of text, etc., and achieve the effect of solving poor classification effect, improving classification effect and improving accuracy.

Pending Publication Date: 2020-05-08
BEIJING GRIDSUM TECH CO LTD
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a text classification method and device to at least solve the technical problem that the traditional text classification scheme has a poor classification effect, resulting in the inability to accurately determine the actual viewpoint category of the text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device
  • Text classification method and device
  • Text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0020] According to an embodiment of the present invention, an embodiment of a text classification method is provided. It should be noted that the steps shown in the flowcharts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although A logical order is shown in the flowcharts, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0021] figure 1 is a flowchart of a text classification method according to an embodiment of the present invention, such as figure 1 As shown, the method includes the following steps:

[0022] In step S102, the text to be classified is obtained by preprocessing the text, wherein the above preprocessing includes at least one of the following: word segmentation processing, part-of-speech tagging, and stop word filtering;

[0023] Step S104, inputting the above-mentioned text to be classified into the target d...

Embodiment 2

[0072] According to an embodiment of the present invention, a device embodiment for implementing the above text classification method is also provided, Figure 4 is a schematic structural diagram of a text classification device according to an embodiment of the present invention, such as Figure 4 As shown, the above-mentioned text classification device includes: a preprocessing module 40, an input module 42 and a classification module 44, wherein:

[0073] The preprocessing module 40 is used to obtain the text to be classified by preprocessing the text, wherein the above preprocessing includes at least one of the following: word segmentation processing, part-of-speech tagging, and stop word filtering; the input module 42 is used to convert the above-mentioned text to be classified The classified text is input to the target depth classification model, wherein the above target depth classification model is determined by training and learning the training samples of the identifi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text classification method and device. The method comprises the steps of preprocessing a text to obtain a to-be-classified text; inputting the to-be-classified text into a target depth classification model, the target depth classification model being determined by training and learning a training sample with an identified viewpoint category; performing vectorization representation on the to-be-classified text according to the target depth classification model, determining a word representation vector, a sentence representation vector and an article representation vector of the to-be-classified text; and classifying the to-be-classified text based on the word representation vector, the sentence representation vector and the article representation vector, and determining a classification result of the to-be-classified text, the classification result at least comprising a viewpoint category. The technical problem that the actual viewpoint category of the text cannot be accurately determined due to the fact that a traditional text classification scheme is poor in classification effect is solved.

Description

technical field [0001] The present invention relates to the field of text processing, in particular to a text classification method and device. Background technique [0002] Text classification is a basic task in natural language processing, with a wide range of application scenarios, including spam classification, sentiment analysis, news topic classification, question classification in automatic question answering systems, etc. In traditional text classification schemes, the effect of text classification mainly depends on the expressive ability of features, that is, whether the features contain enough information for classification, mainly based on statistical classification methods, and the selected features are usually word frequency, TF-IDF, etc. [0003] The main disadvantage of constructing feature vectors through traditional text representation methods (for example, vector space models) is that contextual relations are ignored, similarity calculations are performed o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/289G06F40/30
Inventor 徐文斌
Owner BEIJING GRIDSUM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products