Method and apparatus for mine frequent episodes of multi-source data stream

A technology of frequent episodes and data flow, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as mining

Active Publication Date: 2018-12-18
NORTHWESTERN POLYTECHNICAL UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a method and device for mining frequent episodes of multi-source data streams, which are used to solve the problem ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for mine frequent episodes of multi-source data stream
  • Method and apparatus for mine frequent episodes of multi-source data stream
  • Method and apparatus for mine frequent episodes of multi-source data stream

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0105] Figure 4 A schematic diagram of a data flow structure with a "sequential relationship" provided by Embodiment 1 of the present invention; Figure 5 Provide and Figure 4 Schematic diagram of the corresponding data grid structure.

[0106] Such as Figure 4 As shown, there is W in the figure (1) , W (2) Two data streams, where W (1) :ADCADCBC...,W (2) :BECABEAA…. W (1) , W (2) The sequence relationship of the data items included in the two data streams is as follows: Figure 4 shown. Figure 5 In , use '·' to indicate that there is no sequence relationship between the corresponding two events, and use '×' to indicate that there is a sequence relationship between the corresponding two events.

[0107] For data flow W (1) and W (2) , can be based on Figure 5 Perform frequent episode mining. Specifically include:

[0108] Step 501, traverse the entire data grid to find the first multi-sequence data grid M 31 ;

[0109] Step 502, determine the composition...

Embodiment 2

[0118] Figure 6 It is a schematic diagram of a method for mining frequent episodes of multi-source data streams provided by Embodiment 2 of the present invention, as shown in Figure 6 As shown, the method includes the following steps:

[0119] Step 601, combining two sets of data streams without sequence relationship into a data grid;

[0120] Step 602, traversing the data grids without sequence relationship, if it is confirmed that the data grid contains single-sequence data grids, then perform step 603, if it is confirmed that the data grid contains multi-sequence data grids, then perform step 604;

[0121]Step 603, since the single-sequence data grid can generate a unique sequence, confirm the first data item or the second data item that constitutes the first single-sequence data grid from the data flow without sequence relationship, according to the first data item or the second data item Two data items, directly generate a sequence of single-sequence data grids;

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for mining frequent episodes of multi-source data streams, which relates to the technical field of data mining. In order to solve the problem that the mining object of the existing data stream mining technology is a single data stream, there is a problem that the mining cannot be aimed at multi-level, multi-angle, multi-direction mining. The method comprises the following steps: starting from the initial point position of the data grid, traversing the data stream including the data points with no sequential relationship in the data grid; accordingto the feature that the data streams corresponding to different processes occur at intervals, the data streams with no sequential relationship being confirmed as the data grid sequence; the mixed frequent episodes in the data lattice sequence being confirmed by a frequent episodes mining method.

Description

technical field [0001] The present invention relates to the technical field of data mining, and more specifically relates to a method and device for mining frequent plots of multi-source data streams. Background technique [0002] At present, data mining has been widely used in intelligent transportation systems. It is established by effectively integrating advanced information technology, data communication transmission technology, electronic sensing technology, control technology and computer technology into the entire ground traffic management system. It is a real-time, accurate and efficient comprehensive transportation management system that works in a wide range and in all directions. The application of frequent episode mining of multi-source data streams in the flow control of intelligent traffic sections is mainly based on the passing road information data collected by traffic sensors on different vehicles, combined with the frequent episode mining method of multi-so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 尤涛杜承烈陈进朝李亚敏李宇博
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products