Text classification method and system

A text classification, text technology, applied in semantic analysis, special data processing applications, instruments, etc., can solve the problems of low text classification accuracy, text classification methods cannot express the semantic information of text data, etc., to improve accuracy and reduce noise. , to ensure the effect of accuracy

Active Publication Date: 2017-09-19
IFLYTEK CO LTD
View PDF6 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a text classification method and system to solve the problem that the existing text classi

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and system
  • Text classification method and system
  • Text classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0056] Such as figure 2 Shown is a flow chart of the text classification method provided by the embodiment of the present invention, including the following steps:

[0057] Step S01, pre-constructing a text classification model for text classification based on classification features, the classification features include any one or more of the following: character features, word features, and any one or more of the following: part-of-speech features, dependent syntax feature.

[0058] In this embodiment, the text classification model can use multiple different types of classification features as input, and according to the different dimensions of the preset classification features, different input windows can b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification method and system. The method comprises the steps of pre-building a text classification model for performing text classification based on classification features, wherein the classification features include any one or more of the following features: a character feature and a word feature, and further include any one or more of the following features: a part-of-speech feature and a dependency syntax feature; obtaining to-be-classified text data; extracting the classification features of the to-be-classified text data; and inputting the classification features to the text classification model so as to obtain the text type of the to-be-classified text data. According to the method provided by the invention, semantic information of the text data can be expressed from multiple perspectives by use of the features such as the character feature, the word feature, the part-of-speech feature, the dependency syntax feature and the like, so that the information of the text data can be expressed more completely, and the accuracy of an obtained prediction result is higher when the classification features are used for performing text type prediction.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text classification method and system. Background technique [0002] With the continuous development of information technology and the rapid popularization of the Internet, people are faced with more and more information. While obtaining rich information, it also brings information troubles, that is, a lot of non-target information is flooded in it, which leads people to need It is very inconvenient for the user to select useful or interesting information by browsing all the information. Most of the information people face is text information, how to quickly and efficiently find the text data they need has become an urgent problem to be solved. [0003] In order to solve the above problems, an automatic text classification technology has been developed. The text classification refers to the process of judging and classifying a large number of texts into one or more p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/30G06F40/289
Inventor 胡加学孙瑜声金重九赵乾
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products