Unlock instant, AI-driven research and patent intelligence for your innovation.

A Text Classification Method Based on Naive Bayes

A text classification and text technology, applied in text database clustering/classification, unstructured text data retrieval, instruments, etc., can solve problems such as unsatisfactory results of text classification algorithms, and achieve good practical application value and good performance

Active Publication Date: 2020-12-01
MEISHAN POWER SUPPLY CO STATE GRID SICHUAN ELECTRIC POWER CO
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a text classification method based on naive Bayesian, which solves the technical problem that the existing text classification algorithm is not ideal. Assuming this deficiency independently, the performance of the method is better, and it has a good practical application value in the power user appeal text classification problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Text Classification Method Based on Naive Bayes
  • A Text Classification Method Based on Naive Bayes
  • A Text Classification Method Based on Naive Bayes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention provides a text classification method based on naive Bayesian, which solves the technical problem that the existing text classification algorithm is not ideal. Assuming this deficiency independently, the performance of the method is better, and it has a good practical application value in the text classification problem of power user demands.

[0040] In order to understand the above-mentioned purpose, features and advantages of the present invention more clearly, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. It should be noted that, under the condition of not conflicting with each other, the embodiments of the present application and the features in the embodiments can be combined with each other.

[0041]In the following description, many specific details are set forth in order to fully understand the present invention. However, the present invention can als...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text classification method based on naive Bayesian, comprising: step 1: using a word segmentation tool to form a feature vector of the text to be classified, comparing the feature vector with common words, and removing meaningless words in the text to be classified ;Treat each word s that appears in the classification text i weight w i set; get P(w 1 ,...,w n ) in the training text set D i The probability set Q(w 1 ,...,w n ); put Q(w 1 ,...,w n ) attributes are multiplied to get P(w 1 ,...,w n ) in the training text set D i The prior probability P(w|D i ); Step 3: training text set D i Divide the number of files in the entire training text set by the total number of training texts to get the prior probability P(D i ), P(D i )*P(x|D i ) to get P(w 1 ,...,w n ) in the training text set D i The posterior probability P(D i |w), step 4: repeat steps 2 and 3 to calculate all posterior probabilities; step 5: compare the results of step 4 with the largest posterior probability P(D i ), D i class is P(w 1 ,...,w n ) belongs to the category, the performance of this method is better, and it has a good practical application value in the text classification of power user appeals.

Description

technical field [0001] The invention relates to the field of railway catenary detection, in particular to a text classification method based on naive Bayesian. Background technique [0002] The power customer service department has to face a large number of user appeal information every day. In the traditional mode, the operator classifies the user's appeal information through subjective judgment, and then delivers it to the corresponding department for processing. This method requires manual checking and confirmation one by one, and the informatization and intelligence are seriously insufficient. [0003] The content of text classification of power user appeals is very rich, and these contents are often found in various international conferences and related journals or magazines such as information retrieval, machine learning, knowledge mining and discovery, pattern recognition, smart grid, power science and application, etc. Representative review articles include "Machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/279G06K9/62G06Q50/06
CPCG06F16/35G06Q50/06G06F40/279G06F18/24155G06F18/24323
Inventor 简海英吕磊邓丕杨谦王海袁志刚陈焕章吴红张庆高峰刘悠张威
Owner MEISHAN POWER SUPPLY CO STATE GRID SICHUAN ELECTRIC POWER CO