A mutation detection method based on cloud computing platform spark
A cloud computing platform and mutation detection technology, applied in the field of bioinformatics, can solve problems such as load imbalance, HaplotypeCaller mutation detection method cannot adapt to multi-node environment scenarios, etc., to achieve good load balance, reduce the steps of computing data, and scalability strong effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0033] The present invention will be further described below in conjunction with specific examples.
[0034] Such as figure 1 As shown, the variation detection method based on the cloud computing platform Spark provided in this embodiment includes the following steps:
[0035] 1) The input sequence alignment mapping format file intercepted by the Spark master node is distributed to each Spark working node.
[0036] The input to the method of the invention is a sequence alignment file. The common format of the sequence alignment file is the SAM format (Sequence Alignment / Map), which records the alignment information of the sequencing sequence reads to the reference sequence in a text format. In addition, usually in order to save storage space and improve transmission rate, SAM files will be processed into BAM files by binary compression. A BAM file is a block-based compressed format consisting of a series of data blocks that do not exceed 64Kb. This feature allows efficient r...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


