Dynamic mining method for distributed data streaming

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A data flow and distributed technology, applied in the field of big data processing, can solve the problems of unclear technical details, high cost, and little research, so as to reduce the possibility of data being discarded, reduce data discarding, and reduce network transmission costs Effect

Inactive Publication Date: 2019-04-23

FUJIAN UNIV OF TECH

View PDF1 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The former is simple, but the cost of network transmission is too high for large-scale, fast-flowing online big data

The latter obviously has efficiency advantages, but there are few studies, and naturally many technical details are still unclear

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] Such as figure 1 Or shown in 2, the present invention discloses a kind of dynamic mining method of distributed data flow, and it comprises the following steps:

[0038] Step 1, each local node collects the current data block at the current time t, and performs micro-cluster processing on the current data block of each local node;

[0039] Step 2, each local node performs incremental micro-cluster update of the local mode: each local node incrementally updates the current data block processed by the micro-cluster collected at time t and the local mode maintained at time t-1 to form t local patterns of time;

[0040] Step 3, local mode transmission stage: upload the local mode of each local node at time t to the central node;

[0041] Step 4, the central node reconstructs the global sample data set based on micro-clusters after receiving the local patterns of all local nodes at time t:

[0042] Step 5: The central node performs a newly learned basic learner based on th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a dynamic mining method for distributed data streaming, which comprises the following steps of: S1, by each local node, collecting current data blocks at a current t moment andcarrying out micro cluster processing; S2, by each local node, carrying out incremental micro cluster updating of a local mode; S3, executing a local mode transmission stage, i.e., uploading the local mode of each local node at the t moment to a central node; S4, by the central node, after receiving the local modes of all the local nodes at the t moment, reconstructing a global sample data set onthe basis of a micro cluster; and S5, by the central node, executing a basic learner for new learning on the basis of the global sample data set, and carrying out incremental updating of a global mode in a current state on the basic learner for new learning. According to the invention, in a local mining mode, data can be processed locally to the greatest extent, so as to reduce the probability that the data is discarded.

Description

technical field [0001] The invention relates to the technical field of big data processing, in particular to a dynamic mining method for distributed data streams. Background technique [0002] Big data is a demand-driven concept. Although the 4V attributes of big data have been given, they still describe the appearance of big data, so it is still necessary to find a standardized data structure to accurately describe the technical characteristics of big data. Aiming at the technical characteristics of distributed and fluidity hidden in big data, a data structure called distributed data flow can provide an ideal way for the expression of certain types of big data. [0003] Distributed data flow captures the two main technical characteristics of big data, distributed and fluid, and can help solve the regularized analysis of many types of big data. For example: in a large-scale network monitoring system, it may be necessary to set up multiple monitoring sites to collect data du...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): H04L29/08H04L12/24G06K9/62

CPCH04L41/14H04L67/10H04L67/5651G06F18/23213

Inventor毛国君

OwnerFUJIAN UNIV OF TECH

Dynamic mining method for distributed data streaming

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology