Text classification method based on naive Bayesian model

A Bayesian model, text classification technology, applied in text database clustering/classification, unstructured text data retrieval, character and pattern recognition, etc., can solve the problems of heavy workload, error-prone, low work efficiency, etc. Achieve the effect of reducing workload, simple algorithm, improving accuracy and efficiency

Inactive Publication Date: 2019-11-12
厦门美域中央信息科技有限公司
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] When relevant departments are working, they need to sort out and classify a large number of texts, which is heavy workload, low work efficiency, and error-prone, so a classification method is urgently needed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method based on naive Bayesian model
  • Text classification method based on naive Bayesian model
  • Text classification method based on naive Bayesian model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be understood that these descriptions are exemplary only, and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0026] Such as figure 1 As shown, a kind of text classification method based on naive Bayesian model that the present invention proposes, method step comprises:

[0027] S1. Collect sample data, construct training set and verification set;

[0028] S2. Select common information keywords and determine characteristic attributes; use party members' gender, age, working hours, and job positions as characteristic attributes;...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification method based on a naive Bayesian model, and the method comprises the steps: collecting sample data, and constructing a training set and a verification set; selecting common information keywords, and determining characteristic attributes; establishing a naive Bayes model; training the naive Bayesian model by using the training set; verifying the training result by using the verification set, and correcting and perfecting; model application. According to the method, the text is classified on the basis of the naive Bayesian model, so that the accuracyand efficiency of classification work are improved, the workload of workers is reduced, and the smooth operation of work is ensured.

Description

technical field [0001] The invention relates to the field, in particular to a text classification method based on a naive Bayesian model. Background technique [0002] The Naive Bayesian classifier originated from classical mathematical theory, has a solid mathematical foundation, and stable classification efficiency. At the same time, the Naive Bayesian model needs to estimate few parameters, is not sensitive to missing data, and the algorithm is relatively simple. Has the smallest error rate compared to other classification methods. [0003] When the relevant departments are working, they need to sort and classify a large amount of texts. The workload is heavy, the work efficiency is low, and it is easy to make mistakes. Therefore, a classification method is urgently needed. [0004] In order to solve the above problems, this application proposes a text classification method based on the naive Bayesian model. Contents of the invention [0005] (1) Purpose of the invent...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/35G06F18/24155
Inventor 肖清林
Owner 厦门美域中央信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products