Data clustering method, device and equipment

A data clustering and clustering technology, which is applied in the field of data processing, can solve the problems of large memory consumption and long processing time of data, and achieve the effects of reducing costs, avoiding multiple read-in and read-out, and reducing the long time of processing data

Pending Publication Date: 2022-06-17
BEIJING DATANG TELECOM CONVERGENCE COMMTECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a data clustering method, device, and equipment to solve the problems in the prior art that the data clustering method takes a long time to process data and consumes a large amount of memory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering method, device and equipment
  • Data clustering method, device and equipment
  • Data clustering method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0076] First, some concepts are explained as follows:

[0077] streaming data

[0078] Streaming data is usually defined as a sequence of tuples consisting of continuously arriving meta-components. It is a continuous, time-varying infinite data set with no clear end boundary. It has an infinite amount of data and its value gradually increases with time. characteristics of reduction.

[0079] Clustering Algorithm

[0080] Clustering is to divide a data set into different classes or clusters according to a certain standard (such as distance), so that the features of the data in the same cluster have as much similarity as possible, while the data in the same cluster are not in the same cluster. Features are as diverse as possible.

[0081] Cla...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data clustering method, device and equipment, and the method comprises the steps: determining a first clustering result of basic data according to the basic data on a network platform; obtaining incremental data added into the basic data in each stage; determining a second clustering result of the incremental data of each stage according to the incremental data of each stage; obtaining a target clustering result of the total data according to the first clustering result and the second clustering result; wherein the total data comprises the basic data, incremental data added into the basic data at the current stage, and incremental data added into the basic data before the current stage. According to the scheme, the clustering result of the total data can be obtained in real time, the data processing time of the data clustering method is shortened, multiple times of read-in and read-out of the total data can be avoided, memory consumption is effectively reduced, and the cost of hardware equipment is reduced.

Description

technical field [0001] The present invention relates to the technical field of data processing, and in particular, to a data clustering method, apparatus and device. Background technique [0002] With the rapid development of technologies such as the Internet of Things and 5G networks, a large amount of continuous dynamic streaming data is generated, and timely and rapid analysis of the valuable information in the streaming data will bring huge profits. Due to the new characteristics of stream data, traditional methods for stream data processing have drawbacks in time and resources. First, using the traditional static data clustering method will consume a lot of processing time. Since the value of data is inversely proportional to time, long processing time will reduce the value of data. Secondly, the traditional method takes the full amount of data as the calculation object, and each execution requires a large memory space, resulting in a waste of computing resources and i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06Q30/06
CPCG06Q30/0631G06F18/23213
Inventor 姜晓艳李常力张铭宇
Owner BEIJING DATANG TELECOM CONVERGENCE COMMTECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products