Supercharge Your Innovation With Domain-Expert AI Agents!

Data clustering method, device, system, server and storage medium

A data clustering and server technology, applied in the field of data processing, can solve the problems that the memory cannot be loaded at one time, cannot realize ultra-large-scale data clustering, and is limited.

Active Publication Date: 2022-03-25
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above technology, when clustering ordinary large-scale data, GPU resources that meet the resource requirements can be allocated to it. However, when clustering tens of billions of ultra-large-scale data, due to the GPU resources are limited, which will cause the server's memory to be unable to load the data to be clustered at one time, and clustering of ultra-large-scale data cannot be realized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering method, device, system, server and storage medium
  • Data clustering method, device, system, server and storage medium
  • Data clustering method, device, system, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0117] In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.

[0118] It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosure relates to a data clustering method, device, system, server, and storage medium, and belongs to the technical field of data processing. The method includes: determining the resource requirements of the target data set, and determining multiple servers and multiple servers in response to the resource requirements The GPU resources on the server, through the GPU resources on multiple servers, perform data clustering based on each sub-dataset in the target data set, and obtain the data clustering results of the target data set. In the embodiment of the present disclosure, according to the resource requirements of the target data set, the control server performs data clustering on a part of the data in the target data set through the available GPU resources on multiple servers, and then obtains the data clustering results of the target data set , can realize distributed data clustering, so as to realize data clustering of ultra-large-scale data.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular, to a data clustering method, apparatus, system, server, and storage medium. Background technique [0002] The rapid development of computer technology and mobile Internet has promoted the arrival of the era of big data. The large-scale data generated in modern society has created enormous pressure on the existing data processing methods. The management and utilization of large-scale data has become an inevitable trend. . As the most commonly used data processing method, data clustering has also become very common in large-scale data applications. [0003] At present, the data clustering method is usually as follows: acquiring multiple pieces of data to be clustered, estimating the resource requirements of the multiple pieces of data and the remaining GPU (Graphics Processing Unit, graphics processor) resources on the server, from the remaining GPU resources G...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06F9/50G06F16/182
CPCG06F9/5066G06F16/182G06F18/23
Inventor 张胜卓田燕
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More