Uncertain data model oriented utility item set mining method

A technology for determining data and uncertainty, applied in special data processing applications, electrical digital data processing, instruments, etc., can solve problems such as long running time, different numbers, multiple resources, etc., and achieve the goal of reducing running time and saving resources The effect of consumption

Inactive Publication Date: 2016-05-25
YILAN YUNLIAN TECH CO LTD
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The actual number of various commodities included in the packaged commodities on the platform may not be the same;
[0005] 2. The profits brought by the various commodities actually included in the packaged commodities on the platform are different;
[0024] But when the number of items in the data set increases, for example, when there are 40 items, theoretically there will be 2 itemsets 40 -1 piece, about 1.1×10 12 , although the actual number of itemsets will not reach this number, but the number will still be very large, querying and verifying each itemset separately will occupy and con

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Uncertain data model oriented utility item set mining method
  • Uncertain data model oriented utility item set mining method
  • Uncertain data model oriented utility item set mining method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to better explain the present invention and facilitate understanding, the present invention will be described in detail below through theoretical analysis of the present invention and specific embodiments in conjunction with the accompanying drawings.

[0051] theoretical analysis

[0052] There are two necessary conditions to measure whether an itemset is a high-utility itemset based on uncertainty: utility and expected support reach their respective thresholds.

[0053] Accordingly, the design idea of ​​the present invention is:

[0054] Optimization strategy 1:

[0055] In a data set, the number of occurrences of an itemset must not be less than the number of occurrences of any of its supersets.

[0056] Proof: If k+1 itemset X k+1 is the k-itemset X k A superset of , when (X k+1 at T j appears in ), it must exist (X k at T j appears in ).

[0057] For the expected support, if the expected support of an itemset does not reach the threshold, any ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an uncertain data model oriented utility item set mining method. The method comprises the steps of S1, verifying transaction weighting utility and expected support degree of each item set and forming a total candidate set by taking item sets passing the verification as candidate sets; and S2, verifying the utility of each item set in the total candidate set through an uncertain data model to obtain an uncertainty based high-utility item set, wherein the transaction weighting utility is equal to the sum of transaction utilities of all transactions containing the item sets in the uncertain data model, the item sets forming the total candidate set include a k-item set verified through the step S1, k is 1-n, n is an item number of the transaction with the most items in the uncertain data model, and when k is greater than 1, the k-item set is obtained by using a k-1-item set in the total candidate set as a subset and performing verification through the step S1. According to the method, an optimization method is added in an existing mining method, so that the technical effects of reducing resource consumption and shortening running time are achieved.

Description

technical field [0001] The invention relates to a data mining technology for effectively extracting useful information from a large amount of data, in particular to a utility item set mining method for uncertain data models, which can be used for e-commerce platforms Mining of Utility Itemsets in Uncertain Data Model of Packaged Goods Sales. Background technique [0002] The emergence of data mining technology enables people to effectively extract useful information from a large amount of data. The widespread package sales model on the e-commerce platform (many types of commodities represented by toiletries and cosmetics) has gradually drawn attention to the utility itemset mining model and method in uncertain data. By discovering the actual correlation between products and on the premise that the profit reaches a certain standard, the correct package sales model can be formulated. At the same time, we must pay attention to the user's feedback, which is the criterion for w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/2453
Inventor 兰雨晴王洋
Owner YILAN YUNLIAN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products