Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Weighted association rule mining method based on data source partition matrix

A rule and database technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of high algorithm time cost, large amount of data, and time-consuming, etc., and achieve the effect of fast calculation support

Inactive Publication Date: 2016-11-16
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF10 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually, the amount of data in the database is relatively large, which will take a lot of time for information reading operations, resulting in an excessively high time cost for the algorithm.
[0026] (2) Due to the large amount of data, a large number of candidate frequent itemsets will be generated when the connection operation is performed during the algorithm calculation process, which will also affect the calculation efficiency
[0027] (3) The traditional Apriori algorithm treats the data items in the transaction database equally, and thinks that each data item plays the same role in association rule mining, and cannot reflect the emphasis of the data item on the association rule conclusion, which will lead to Some "trivial" rules may appear in the results of mining, which has certain limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Weighted association rule mining method based on data source partition matrix
  • Weighted association rule mining method based on data source partition matrix
  • Weighted association rule mining method based on data source partition matrix

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0100] In order to better illustrate the purpose and advantages of the present invention, the present invention will be further described below in conjunction with the embodiments and accompanying drawings. In the following specific applications, an example is given to illustrate the method for constructing the knowledge base of disease diagnosis experts in the medical field. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0101] 1. Data preprocessing

[0102] Data acquisition is performed first.

[0103] One way is to use Internet resources. Open the webpage, analyze the attributes of each tag in the source code of the webpage, use regular expressions to search for matches, and find the tags that cont...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a weighted association rule mining method based on a data source partition matrix, and belongs to the technical field of manual intelligence, machine learning and data mining. By using a manual intelligence method, implied valuable expert knowledge can be mined from massive and complex industrial text data and expressed in an association rule manner, and a support can be provided for establishment of a knowledge base in an expert system. According to the method, a classic association rule mining algorithm is analyzed; a method for rapidly computing a support degree based on the data source partition matrix is provided aiming at the problem that an Apriori algorithm is low in efficiency as a data set is scanned repeatedly; and a weighted calibration method based on probability is used for weighting data item sets during an association rule mining process aiming at the problem that the Apriori algorithm treats data items in the transaction data set equally. Experiments verify that the provided method can reduce space complexity and time complexity of traditional association rule mining.

Description

technical field [0001] The invention uses a data mining method to mine association rules for valuable expert knowledge hidden in massive industry text data to obtain expert knowledge represented by association rules; it belongs to the fields of artificial intelligence, machine learning, and data mining. Background technique [0002] Association rules are used to describe a certain relationship between a thing and other things, which mainly manifests as the mutual influence relationship between things. In most cases, the mutual influence relationship between a large number of things cannot be discovered manually, and the computer artificial intelligence technology is used to organize and analyze the data, and express these relationships in the form of rules, which is association rule mining. For example, in a data set, assuming that there is a relationship between two things, it means that the information of one thing can be used to infer the other thing related to it. In co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/24564G06F16/2465
Inventor 孙新王璇严西敏欧阳童王乐和郭文浩
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products