Fault recovery on a massively parallel computer system to handle node failures without ending an executing job
A computer system, computing node technology, applied in the direction of non-redundant fault handling, computers, digital computer components, etc., can solve the increase of soft and hard faults, restart, parallel computer system inefficient hardware utilization of computer shutdown issues of time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0016] The present invention relates to an apparatus and method for recovering from a soft failure on a node of a parallel computer system without ending a job being executed on a node partition including the failed node. The preferred embodiment will be described in terms of the Blue Gene / L massively parallel computer developed by International Business Machines Corporation (IBM).
[0017] figure 1 A block diagram representing a massively parallel computer system 100 such as the BlueGene / L computer system is shown. The BlueGene / L system is a scalable system, where the maximum number of computing nodes is 65,536. Each node has an Application Specific Integrated Circuit (ASIC) 112 , also referred to as a bluegene / L computing chip 112 . The computing chip incorporates two processors or central processing units (CPUs) and is mounted on node daughter card 114 . The nodes typically have 512 megabytes of local memory. The node board 120 accommodates 32 node daughter cards 114 ea...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com