Unlock instant, AI-driven research and patent intelligence for your innovation.

Data analysis method and system based on weighted positive and negative sequence pattern

A data analysis system, sequential mode technology, applied in manufacturing computing systems, electrical digital data processing, special data processing applications, etc.

Active Publication Date: 2019-03-15
南方电网互联网服务有限公司
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical task of the present invention is to provide a data analysis method and system based on weighted positive and negative sequence patterns to solve how to provide a data with simple mining process, short time consumption, and the ability to assign weights to item sets according to their importance The problem with analytical methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data analysis method and system based on weighted positive and negative sequence pattern
  • Data analysis method and system based on weighted positive and negative sequence pattern
  • Data analysis method and system based on weighted positive and negative sequence pattern

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0063] The data analysis method based on weighted positive and negative sequence patterns of the present invention is used to mine data, including the following settings:

[0064] 1. Define the minimum support of the negative sequence

[0065] MIS(i) represents the minimum item support of item i, where i is a positive or negative item;

[0066] A positive element, that is, an event has occurred, and its minimum support is the minimum support value of item i in the element;

[0067] Negative elements, that is, no events, use the information of related positive elements to calculate their minimum support: MIS(-i)=1-MIS(i); for negative sequences Where a, b represent itemset elements, the minimum support of the negative sequence is:

[0068]

[0069] In the negative sequence Y, the item set element is e 1 ,e 2 ,...e r , the minimum support of the negative sequence Y is the minimum support value of the elements in the sequence, where the minimum support of Y is:

[0070]...

Embodiment 2

[0147] The data analysis system based on weighted positive and negative sequence patterns of the present invention is used to implement the data analysis method based on weighted positive and negative sequence patterns disclosed in Embodiment 1, including:

[0148] The positive sequence mining module is used to mine weighted frequent positive sequence patterns based on the connection operation and pruning operation in the GSP algorithm on the condition that the weighted support degree is greater than the set minimum support degree;

[0149] The candidate negative sequence mining module is used to generate a corresponding negative sequence seed set based on the 1-length positive sequence pattern in the weighted frequent positive sequence pattern, and generate a candidate negative sequence pattern by connecting;

[0150] The negative sequence mining module is used to mine weighted frequent negative sequence patterns by taking the weighted support degree greater than the set minim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data analysis method and a data analysis system based on weighted positive and negative sequence patterns, belonging to the field of computer data mining, which solves the problem that how to provide a data analysis method which is simple in mining process, short in time consumption and capable of realizing the distribution of weight values according to the importance ofan itemset. The method includes mining weighted frequent positive sequence patterns based on join operation and pruning operation in GSP algorithm under the condition that weighted support degree is greater than the set minimum support degree. Based on 1-length positive sequence patterns, generating corresponding negative sequence seed sets, and generating candidate negative sequence patterns by linking; The weighted frequent negative sequence patterns are mined under the pruning condition that the weighted support degree is greater than the set minimum support degree. The system includes positive sequence mining module, candidate negative sequence mining module and negative sequence mining module.

Description

technical field [0001] The invention relates to the field of computer data mining, in particular to a data analysis method and system based on weighted positive and negative sequence patterns. Background technique [0002] In the process of sequential pattern mining, sometimes it is not only necessary to count the relationship between item sets when they appear, but also to count the relationship between missing item sets and appearing item sets, thus producing negative sequence pattern mining technology. Negative sequence pattern mining technology can dig deeper into sequence databases that are easily overlooked but very valuable information. [0003] Existing data mining algorithms include: PNSP algorithm, Neg-GSP algorithm, MMSWNeg-GSP algorithm, among them, PNSP algorithm can dig out positive and negative sequence patterns, which is based on the GSP algorithm to generate sequence patterns by connecting positive and negative itemsets. Finally, an algorithm that generates...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/2458
CPCY02P90/30
Inventor 姜合王东远潘品臣吕奕锟
Owner 南方电网互联网服务有限公司