Docker container-oriented Spark big data application program performance modeling method and device and storage device
A docker container, application technology, applied in the direction of program control design, multi-programming device, program control device, etc., can solve problems such as the inability to give full play to the advantages of cloud computing systems and the unstable performance of big data applications
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0045] Embodiment one, refer to figure 1 , the present invention is oriented to the implementation of the Spark big data application performance modeling method of Docker container comprising:
[0046] 101. Obtain key parameters that affect the performance of Docker containers and Spark big data applications, and collect corresponding experimental data;
[0047]Obtain key parameters that affect the performance of Docker containers and Spark big data applications, and collect corresponding experimental data. Specifically, the CPU, memory, and I / O resource allocation of Spark big data applications on typical Docker containers can be adjusted jointly. Determine the key parameters of Docker container resource allocation and Spark resource allocation that affect the performance of Spark big data applications on Docker containers, and collect corresponding experimental data based on Spark big data applications. specific:
[0048] In order to study the impact of different resource ...
Embodiment 2
[0064] Embodiment two, refer to figure 2 , the present invention is oriented to the implementation of the Spark big data application performance modeling method of Docker container comprising:
[0065] 201. Obtain the key parameters of the Docker container;
[0066] Get the key parameters of the Docker container. Specifically, by inserting different parameter options into the command to start the Docker container to limit the resource usage of the Docker container, and obtain key parameters that affect performance, mainly CPU, memory and disk-related key parameters.
[0067] 202. Obtain key parameters that affect the execution performance of Spark big data applications;
[0068] In the case of limiting the resource usage of a Docker container, adjust the resource allocation of the Spark big data application, and obtain the key parameters that affect the execution performance of the Spark big data application.
[0069] 203. Collect experimental data;
[0070] When obtainin...
Embodiment 3
[0081] Embodiment 3, the present invention is specifically described through a specific application example below:
[0082] Step 1, parameter adjustment, that is to obtain key parameters that affect the performance of Docker containers and Spark big data applications, and collect corresponding experimental data, specifically:
[0083] It is necessary to deploy a Docker container-based big data cluster, set the value of the Docker container resource allocation parameter, and set the value of the corresponding Spark big data application resource allocation parameter, and conduct an experimental test. The Docker container resource allocation parameters are detailed in Table 1. The resource allocation parameters of the Spark big data application are detailed in Table 2 and will not be described here.
[0084]In this application example, we choose the HDFS file system and Yarn resource manager in the Hadoop ecosystem, as well as the Spark distributed computing framework. Using Doc...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


