Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for item-sets mining

A technology of itemsets and centralization, applied in the field of itemset mining methods and devices, can solve the problems of mining high-weight itemsets, etc., and achieve the effect of narrowing the mining scope and improving efficiency.

Active Publication Date: 2016-10-19
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL +1
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the problem that high-weight item sets cannot be mined for uncertain data, the embodiment of the present invention provides an item set mining method and device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for item-sets mining
  • Method and device for item-sets mining
  • Method and device for item-sets mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0024] In order to facilitate the description of the embodiments of the present invention, the basic concepts involved in the embodiments of the present invention are introduced in advance as follows:

[0025] 1. Transaction (transaction): refers to a record in the database. For example, when the database records the purchase records of supermarket commodities, each transaction in the database corresponds to the purchase records of the commodities...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for item-sets mining, and belongs to the field of data mining. The method comprises: obtaining a self-defined weight and a lowest expectation weight threshold value [epsilon]; according to occurrence probability and weight of a data item, calculating item weight probability upper limit iubwp of item-sets in an uncertainty database D, mining the item-sets of the iubwp>= |D| x [epsilon] as a high expectation weight upper limit item-set HUBEWI; calculating expectation weight support degree expWSup of each HUBEWI, and mining the HUBEWI of expWSup >= |D| x [epsilon] as a high weight item-set HEWI. Through calculating the item weight probability upper limit of the item-set, the high expectation weight upper limit item-set is obtained, and through calculating the expectation weight support degree of the high expectation weight upper limit item-set, the high weight item-set is obtained, by little calculated amount, the high expectation weight upper limit item-set is mined firstly as a candidate item-set, and mining range of the high weight item-sets is reduced, so as to solve problems that mining high weight item-set can just process accurate data, and a high weight item-set mining technology aimed at the uncertainty database does not exist, and an effect of improving mining efficiency is improved.

Description

technical field [0001] The invention relates to the field of data mining, in particular to an item set mining method and device. Background technique [0002] An uncertainty database (English: uncertain database) usually includes at least one transaction (English: transaction), and each transaction includes at least one data item (English: item). For example, a transaction about weather records includes weather data items such as type, humidity and temperature. Each data item has its own corresponding probability of occurrence. [0003] In an existing data mining method, the user defines the weight of each data item, and then according to the weight of each data item, excavates high-frequency weight item sets from each data item set (itemset) in the precise database. (English: High Frequent Weighted Itemset, referred to as: HFWI). An itemset is a collection of at least one data item, which is used to represent an inherent association rule in a precise database. [0004] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 林浚玮李勇王巨宏赖晓平甘文生
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL