Adaptive sampling model optimization method for data acquisition of large-scale data center

A large-scale data and data acquisition technology, applied in electrical digital data processing, instruments, hardware monitoring, etc., can solve problems such as computational resource consumption, monitoring system difficulties, and upper node response bottlenecks, so as to maintain reconstruction accuracy and reduce acquisition costs. , the effect of reducing the acquisition delay

Pending Publication Date: 2021-10-22
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of the above methods can realize data-driven adaptive real-time collection, increase the value density of collected data, and reduce the cost of collection tasks when the fluctuation of operating data is small, but it does not change the difficulty of collecting hundreds of thousands of nodes in large-scale data centers , because when the data fluctuates greatly, the acquisition task is still difficult to achieve real-time performance; without utilizing the inherent characteristics of the operating data, it is easy to generate a response bottleneck at the upper-level node or require a large number of data acquisition centers and processing centers. Cannot meet real-time requirements or consume too much computing resources, which is unbearable for the monitoring system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adaptive sampling model optimization method for data acquisition of large-scale data center
  • Adaptive sampling model optimization method for data acquisition of large-scale data center
  • Adaptive sampling model optimization method for data acquisition of large-scale data center

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0051] Such as Figure 1-4 Shown, method of the present invention comprises the following steps:

[0052] Step 1, the formal expression of the application scenario of the present invention is: the total number of isomorphic individuals to be collected is N, the number of collected indicators is K, the collection duration is 0-T, and at each time t, all individuals are sampled at a fixed sampling rate r The data collected at the current moment, the collected data of individual i is expressed as where x 0 ,...,x K are real numbers, and all the data collected at time t are expressed as The unacquired data of individual j is expressed as The sampling decision vector at time t is expressed as Among them, for the individual i who decides to collect, For individual j who decides not to collect, The acquisition cost at time t is de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an adaptive sampling model optimization method for data acquisition of a large-scale data center. The method comprises the following steps: step 1, collecting data of all limited single individuals in a data set to calculate a reconstruction error Ett so as to accurately evaluate a reconstruction result; step 2, establishing a sampling model, a reconstruction model, an estimation cost function, an error evaluation function and a loss function; step 3, dividing fully-collected data into a training set Dtrain and a test set Dtest which are independent and identically distributed; step 4, jointly training, testing and selecting the sampling model and the reconstruction model; step 5, deploying a sampling reconstruction model in real application scenario. According to the invention, the collection delay can be reduced, meanwhile, a unified collection optimization target is provided for multiple potential applications of operation data, and under the conditions that the collection cost and the reconstruction precision are comprehensively considered and all data are not observed before collection. Through establishing and optimizing the sampling model, the sampling and collecting of the operation data of the data center is adaptively done according to the incomplete historical record.

Description

technical field [0001] The invention belongs to the technical field of data collection, and in particular relates to an adaptive sampling model optimization method for large-scale data center data collection. Background technique [0002] At present, data center operation data is used for multiple data center intelligent management tasks such as energy consumption analysis and management, workflow scheduling, and task scheduling. With the increasing scale of cloud data centers, data-driven data center operation data collection become an important research question. The existing large-scale cloud data center collection methods are divided into two categories: one kind of method reduces the cost of collection by dynamically adjusting the collection strategy or collection frequency, and the other kind of method mainly uses distributed processing mechanism to collect running data. For example, the patent No. CN201310028813.7 discloses a cloud data center information difference ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30
CPCG06F11/3072G06F11/3093G06F11/3096G06F11/3006
Inventor 韩锐刘驰闫和东
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products