Text classification method

A text classification and text information technology, applied in the field of deep learning, can solve the problems of losing valuable information and inaccurate classification results, and achieve the effect of avoiding loss and improving classification accuracy

Active Publication Date: 2018-11-16
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF4 Cites 66 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing classification method based on deep learning will lead to the loss of a large amount of valuable information, which will make the classification results inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method
  • Text classification method
  • Text classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purpose, technical solution, design method and advantages of the present invention clearer, the present invention will be further described in detail through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0042] According to an embodiment of the present invention, a text information classification method is provided. In short, the method includes performing word segmentation and word segmentation processing on the text information, and performing high-dimensional feature representation on the word segmentation results and word segmentation results, To construct a training sample set; use the training sample set to train a deep learning model to obtain a text classification model; apply the text classification model to classify text. Specifically, see figure 1 As shown, the me...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method of constructing a text classification model. The method comprises the steps of: constructing a training sample set according to structure features of characters, wordsand sentences of text information, wherein each piece of sample data in the training sample set corresponds to a feature matrix A of a piece of text information about the words and a feature matrix Babout the characters and a category vector O corresponding to the piece of text information, and the dimension number of O is the same as a category number; and using the feature matrix A about the words and the feature matrix B about the characters in the training matrix set as input and the corresponding category vector O as output to train a deep learning model to obtain the text classificationmodel. Classification is carried out according to the classification model constructed by the method, an accuracy rate of text classification can be improved, and the method is particularly suitablefor use in short-text classification.

Description

technical field [0001] The invention relates to the technical field of deep learning, in particular to a text classification method. Background technique [0002] Text classification refers to determining a category for each document in a document collection according to predefined subject categories. Text classification technology has a wide range of applications in daily life, for example, filtering junk messages and emails, grouping news and so on. [0003] With the rapid development of social media such as Weibo and WeChat, short texts have become an important form of information. Short texts usually have the following characteristics: the number of words is small, and the length of short texts is usually relatively short, generally within 200 words, so , the effective information contained is also very little; the update is fast, most of the information in the form of short text on the Internet is updated in real time, and the refresh rate is very fast, for example, ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 赵莉姜松浩张程赵晓芳段东圣杜翠兰
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products