Method for classifying texts on basis of nave Bayes

A text classification and text technology, applied in the direction of text database clustering/classification, unstructured text data retrieval, instruments, etc., can solve the problem of unsatisfactory text classification algorithm, and achieve good practical application value and good performance.

Active Publication Date: 2017-11-24
MEISHAN POWER SUPPLY CO STATE GRID SICHUAN ELECTRIC POWER CO
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a text classification method based on naive Bayesian, which solves the technical problem that the existing text classification algorithm is not id

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for classifying texts on basis of nave Bayes
  • Method for classifying texts on basis of nave Bayes
  • Method for classifying texts on basis of nave Bayes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention provides a text classification method based on naive Bayesian, which solves the technical problem that the existing text classification algorithm is not ideal. Assuming this deficiency independently, the performance of the method is better, and it has a good practical application value in the text classification problem of power user demands.

[0040] In order to understand the above-mentioned purpose, features and advantages of the present invention more clearly, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. It should be noted that, under the condition of not conflicting with each other, the embodiments of the present application and the features in the embodiments can be combined with each other.

[0041]In the following description, many specific details are set forth in order to fully understand the present invention. However, the present invention can als...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for classifying texts on the basis of nave Bayes. The method includes steps of 1, forming feature vectors of the to-be-classified texts by the aid of word segmentation tools, comparing the feature vectors to common words, removing meaningless words in the to-be-classified texts and setting weights w for each word s which appears in the to-be-classified texts; 2, acquiring probability sets Q (w<1>, ..., w<n>) of P (w<1>, ..., w<n>) which appear in training text sets D, and multiplying attributes in the Q (w<1>, ..., w<n>) to obtain prior probability P (w|D) of the P (w<1>, ..., w<n>) which appear in the training text sets D; 3, dividing the quantities of files in the training text sets D by the total number of integral training text sets to obtain prior probability P (D), multiplexing the P (D) by P (x|D) to obtain posterior probability P (D|w) of the P (w<1>, ..., w<n>) in the training text sets D; 4, repeatedly carrying out the steps 2 and 3 and computing all posterior probability; 5, comparing results obtained at the step 4 to obtain the maximum posterior probability P (D). The P (w<1>, ..., w<n>) belong to D categories. The method has the advantages that the method is good in performance and has excellent practical application values in classifying appeal texts of electric power users.

Description

technical field [0001] The invention relates to the field of railway catenary detection, in particular to a text classification method based on naive Bayesian. Background technique [0002] The power customer service department has to face a large number of user appeal information every day. In the traditional mode, the operator classifies the user's appeal information through subjective judgment, and then delivers it to the corresponding department for processing. This method requires manual checking and confirmation one by one, and the informatization and intelligence are seriously insufficient. [0003] The content of text classification of power user appeals is very rich, and these contents are often found in various international conferences and related journals or magazines such as information retrieval, machine learning, knowledge mining and discovery, pattern recognition, smart grid, power science and application, etc. Representative review articles include "Machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27G06K9/62G06Q50/06
CPCG06F16/35G06Q50/06G06F40/279G06F18/24155G06F18/24323
Inventor 简海英吕磊邓丕杨谦王海袁志刚陈焕章吴红张庆高峰刘悠张威
Owner MEISHAN POWER SUPPLY CO STATE GRID SICHUAN ELECTRIC POWER CO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products