Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification method and device

A text classification and text technology, applied in the computer field, can solve the problem of low classification efficiency

Inactive Publication Date: 2015-07-01
INSPUR GROUP CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the present invention provides a text classification method and device to solve the problem of low classification efficiency in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device
  • Text classification method and device
  • Text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0039] like figure 1 As shown, the embodiment of the present invention provides a text classification method, which pre-sets the dimension threshold, and the method may include the following steps:

[0040] Step 101: Determine the texts to be classified and the multi-dimensional vectors corresponding to each text.

[0041] Step 102: According to the preset dimension threshold, and, the multi-dimensional vector corresponding to each text, obtain the fir...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text classification method and device. The method includes the following step that texts to be classified and multidimensional vectors corresponding to each text are determined; the first dimensional vector corresponding to each text is obtained and analyzed to obtain multi-class themes corresponding to the texts to be classified; one unclassified text is selected from the texts to be classified, the first dimensional vector corresponding to the selected text and each second dimensional vector are calculated to obtain cosine similarity, the classification theme corresponding to the maximum cosine similarity serves as the theme of the selected text, and the step continues to be executed until all the texts to be classified are classified. According to the scheme, text classification efficiency is improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a text classification method and device. Background technique [0002] Text classification technology has applications in many fields. For example, texts are classified, and the classified texts are used to guide the training of translation models in machine translation. It can be seen that the accuracy of text classification is very important. Classified texts with high accuracy can be used in other However, if the accuracy of text classification is not enough, it will have a negative impact on the applications that use these classified texts. [0003] In the existing text classification methods, the training corpus is usually used for classifier training, and then the trained classifier is used to classify the text, and the classification efficiency is low. Contents of the invention [0004] In view of this, the present invention provides a text classification method and d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 于振梅刘艺张连超刘宇张鹏
Owner INSPUR GROUP CO LTD