Heterogeneous Hadoop cluster-based task scheduling method
A technology for Hadoop clustering and task scheduling, applied in the field of big data, it can solve problems such as the inability to meet performance requirements, and achieve the effect of improving the utilization of cluster resources and speeding up the completion time.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0071]Two different types of physical hosts are used to form a heterogeneous Hadoop cluster. One type of physical host has a 4-core CPU (model is I7-4790), the main frequency is 3.6GHz, and the memory is 16GB. Another type of physical host is also a 4-core CPU (model is Intel Xeom E3-1231v3), the main frequency is 3.4GHz and the memory is 16GB. The Hadoop cluster consists of 6 virtual machine nodes, and these 6 virtual machines are distributed on two different types of hosts. In the Hadoop cluster, because the cluster size is relatively small, the data in HDFS is set from 3 backups to 2 backups. The HDFS data block size is set to 64MB. The virtual machine uses VMware workstation12.0, and the Ubuntu14.04 version installed in the operating system. The cluster is installed with Hadoop2.4.1 version. The specific configuration of the cluster is shown in Table 1.
[0072] Table 1 Hadoop cluster configuration
[0073]
[0074] In this embodiment, a comparative experiment is ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com