Supercharge Your Innovation With Domain-Expert AI Agents!

Data mining method, device and equipment based on Bayesian classification algorithm

A Bayesian classification and data mining technology, applied in data mining, special data processing applications, unstructured text data retrieval, etc., can solve problems that affect the acceptability of results, poor interpretability of output results, and low confidence , to achieve the effect of giving full play to the potential value of data and quickly and efficiently mining and displaying

Pending Publication Date: 2022-06-24
SHENZHEN ZHENGTONG ELECTRONICS +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The current data mining algorithms include methods such as Bayesian, decision tree, neural network, and association analysis. Decision tree is a basic classification and regression method. In classification problems, it represents the process of classifying instances based on features. Generally, They are all generated from top to bottom, representing a collection of if-then rules. Its advantage is that it is intuitive and generates understandable rules. However, as the complexity of data increases, its branch trees will also increase. Management difficulty
Moreover, it is difficult to discover rules based on multiple variable combinations, and it is difficult to predict continuous fields
The neural network can process continuous signals, has good self-adaptation, self-organization and strong learning functions, and has an information processing mode closer to the human brain. It has many applications in the fields of natural language understanding, image recognition and speech recognition, but Its hidden layer is a black box to the outside world, the factors for drawing conclusions are not obvious, and the interpretability of the output results is poor, which affects people's acceptance of the results
The advantage of the correlation analysis method is that it can produce clear results, and the processing process can be seen, but the data with low confidence in this method may also reflect very important market information, which will be ignored

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining method, device and equipment based on Bayesian classification algorithm
  • Data mining method, device and equipment based on Bayesian classification algorithm
  • Data mining method, device and equipment based on Bayesian classification algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0059] Please refer to figure 1 , an embodiment of the present invention provides a data mining method based on a Bayesian classification algorithm, the method comprising:

[0060] S100, acquiring text information.

[0061] In this embodiment of the present invention, the text information that needs to be mined is first obtained. For example, get product reviews of a brand from the Internet as input.

[0062] S200: Preprocess the text information to obtain target text information.

[0063] After obtaining the text information that needs data mining, because the text information contains some information other than words such as expressions, special symbols, punctuation marks, etc., these information will interfere with our data mining and have no use value. Therefore, before data mining, it is necessary to preprocess the text information to obtain the target text information.

[0064] For details, please refer to figure 2 , step S200 includes:

[0065] S210, remove punc...

Embodiment 2

[0122] Please refer to Figure 7 , an embodiment of the present invention provides a device for building a knowledge graph based on unsupervised syntactic analysis, the device includes: a text acquisition module 100 , a text processing module 200 , a text conversion module 300 , a text classification module 400 and a statistics module 500 .

[0123] a text acquisition module 100 for acquiring text information;

[0124] A text processing module 200, configured to preprocess the text information to obtain target text information;

[0125] A text conversion module 300, configured to process the target text information to obtain a text vector of the target text information;

[0126] A text classification module 400, configured to input the text vector into a Bayesian classification model to obtain a category of the text information;

[0127] The statistics module 500 is configured to count the high-frequency words of each category in the category to obtain a set of high-frequenc...

Embodiment 3

[0135] An embodiment of the present invention provides a computer-readable storage medium, on which computer instructions are stored, and when the instructions are executed by a processor, implement the steps of the method in the first embodiment. Information is also stored on the storage medium. Wherein, the storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a flash memory (Flash Memory), a hard disk (Hard Disk Drive, Abbreviation: HDD) or solid-state drive (Solid-State Drive, SSD), etc.; the storage medium may also include a combination of the above-mentioned types of memories.

[0136] Those skilled in the art can understand that all or part of the processes in the methods of the above embodiments can be completed by instructing relevant hardware through a computer program, and the program can be stored in a computer-readable storage medium, and when the program is executed ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a data mining method, device and equipment for a Bayesian classification algorithm, and the method comprises the steps: obtaining text information, carrying out the preprocessing of the text information, obtaining target text information, processing the target text information, obtaining a text vector of the target text information, inputting the text vector into a Bayesian classification model, and obtaining a Bayesian classification model. Obtaining a category of the text information, and counting high-frequency words of each category in the category to obtain a high-frequency word set of each category; according to the method, the problems that text information in the current internet is redundant, unclassified, too rough and lack of guidance value in the prior art are solved, the data are quickly and efficiently mined and displayed, and the potential value of the data is exerted.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a data mining method, device and equipment based on a Bayesian classification algorithm. Background technique [0002] Data mining involves many subject areas and methods, and the main task is to mine useful information from massive information or data. With the development of information technology and smart devices, more and more text data are generated in cyberspace, and the problem of text information overload is increasing. At present, we can easily and quickly obtain a large amount of information, the frequency of obtaining information has increased, and the difficulty of mining key information has also increased. Especially in the e-commerce applications that have emerged with the development of the times, online shopping users can complete the purchase of goods without leaving home, but it is usually difficult to distinguish the quality of the goods. At this time, us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/35G06F2216/03G06F18/24155
Inventor 唐卓罗文明曹嵘晖纪军刚尹旦宋柏森朱纯霞赵环
Owner SHENZHEN ZHENGTONG ELECTRONICS
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More