Supercharge Your Innovation With Domain-Expert AI Agents!

Data clustering method, device and system, server and storage medium

A data clustering and server technology, applied in the field of data processing, can solve problems such as limited memory, inability to load at one time, and inability to realize ultra-large-scale data clustering.

Active Publication Date: 2020-08-25
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above technology, when clustering ordinary large-scale data, GPU resources that meet the resource requirements can be allocated to it. However, when clustering tens of billions of ultra-large-scale data, due to the GPU resources are limited, which will cause the server's memory to be unable to load the data to be clustered at one time, and clustering of ultra-large-scale data cannot be realized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering method, device and system, server and storage medium
  • Data clustering method, device and system, server and storage medium
  • Data clustering method, device and system, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0117] In order to enable ordinary persons in the art to better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings.

[0118] It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the following exemplary examples do not represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatuses and methods consi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a data clustering method, device and system, a server and a storage medium. The invention belongs to the technical field of data processing, the method comprises the steps ofdetermining a resource demand of a target data set, responding to the resource demand, determining a plurality of servers and GPU resources on the plurality of servers, and performing data clusteringbased on each sub-data set in the target data set through the GPU resources on the plurality of servers to obtain a data clustering result of the target data set. According to the embodiment of the invention, the control server performs data clustering on a part of data in the target data set through the available GPU resources on the plurality of servers according to the resource requirements ofthe target data set so as to obtain the data clustering result of the target data set, and distributed data clustering can be realized, so that data clustering of super-large-scale data is realized.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular to a data clustering method, device, system, server and storage medium. Background technique [0002] The rapid development of computer technology and mobile Internet has promoted the arrival of the era of big data. The large-scale data generated in modern society has put enormous pressure on the existing data processing methods. The management and utilization of large-scale data has become an inevitable trend. . As the most commonly used data processing method, data clustering has become very common in large-scale data. [0003] At present, the data clustering method is usually: obtain a plurality of data to be clustered, estimate the resource requirements of the plurality of data and the remaining GPU (Graphics Processing Unit, Graphics Processing Unit) resources on the server, from the remaining GPU resources Allocate GPU resources that meet resource requir...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06F9/50G06F16/182
CPCG06F9/5066G06F16/182G06F18/23
Inventor 张胜卓田燕
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More