Text topic classification method and system

A classification method and classification system technology, applied in the field of classification methods and systems of text topics, can solve problems such as reducing classification accuracy, and achieve the effects of avoiding classification error rate, improving accuracy rate and recall rate, and simple calculation.

Active Publication Date: 2015-11-11
SHANGHAI GAOXIN COMP SYST CO LTD
View PDF5 Cites 49 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0030] Moreover, the Bayesian network-based classification method assumes that the influence of an attribute value on a given type is independent of the va

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text topic classification method and system
  • Text topic classification method and system
  • Text topic classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] In order to make the object, technical solution and advantages of the present invention clearer, various embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. However, those of ordinary skill in the art can understand that, in each implementation manner of the present invention, many technical details are provided for readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in each claim of the present application can be realized.

[0054] The first embodiment of the present invention relates to a method for classifying text topics, the specific process is as follows figure 2 shown, including the following steps:

[0055] Step 201, collecting corpus. Specifically, the crawler technology can be used to collect texts (corpus) of various topic types, an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to the technical field of text topic classification, and discloses a text topic classification method and system. According to the present invention, the text topic classification method comprises the following steps of: collecting corpuses, wherein the corpuses comprise texts of various types of topics; performing word segmentation on the corpuses, and performing text feature extraction on the corpuses after word segmentation to obtain feature vectors of the texts of the various types of topics; adjusting feature values in the feature vectors of the texts of the various types of topics according to a dynamic logarithmic excitation function, to obtain new feature vectors of the texts of the various types of topics; and according to similarity between a text to be classified and the new feature vectors of the texts of the various types of topics, classifying the text to be classified and determining a topic type of the text to be classified. In this way, text classification becomes more accurate.

Description

technical field [0001] The present invention relates to text topic classification technology, in particular to a text topic classification method and system. Background technique [0002] With the development of Internet information, the demand for text classification is also increasing day by day. Text classification can solve the phenomenon of online information clutter to a certain extent, and can facilitate users to accurately locate the information they need. Text classification is to use a large number of labeled training samples to map the text to be classified into the specified category through a classification model or function. [0003] Text classification methods can be divided into rule-based classification methods and statistical-based classification methods. Among them, the rule-based classification method generates a rule base by learning the training set, and optimizes it, and the finally obtained rule base is a rule classifier. However, the disadvantage o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 周诚赵世亭赵营营
Owner SHANGHAI GAOXIN COMP SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products