Genome data storage method and electronic device

A data storage and genome technology, applied in the field of data processing, can solve problems such as low efficiency and achieve the effect of improving efficiency

Active Publication Date: 2020-08-11
UNITED ELECTRONICS
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] In view of this, the purpose of the present invention is to provide a genome data storage method and electronic equipment, which can solve the problem of low efficiency caused by frequent input and output of a large number of binary files in the genome variation detection process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Genome data storage method and electronic device
  • Genome data storage method and electronic device
  • Genome data storage method and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0087] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0088] It should be noted that all the expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same. It can be seen that "first" and "second " is only for the convenience of expression, and should not be understood as a limitation to the embodiments of the present invention, and will not be described one by one in the subsequent embodiments.

[0089] Based on the above purpose, the first aspect of the embodiments of the present invention proposes an embodiment of a genome data storage method, which can solve the problem of low efficiency caused by frequent input and output of a large number ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a genomic-data storage method. The genomic-data storage method includes the steps that in the genome comparison process, gene-sequence comparison information is obtained, and gene-sequence statistical information is established; the gene-sequence comparison information is stored in a magnetic disk, and according to comparison positions of the gene-sequence comparison information in a genome, corresponding indexes are stored in a memory, wherein the indexes are storage positions of the gene-sequence comparison information in the magnetic disk; genome statistical information is classified, and first statistical information and second statistical information are obtained; the first statistical information is stored in the memory, wherein the first statistical information is statistical information with the access frequency higher than the preset frequency in the variation detection process; the second statistical information is stored in the magnetic disk, wherein the second statistical information is statistical information which cannot be stored in the memory and / or statistical information with the access frequency lower than the preset frequency in the variation detection process. The invention also discloses an electronic device with the genomic-data storage method.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a genomic data storage method and electronic equipment. Background technique [0002] The calculation process of genomic variation detection can generally be divided into steps such as comparison, sorting, deduplication, re-alignment, variation detection, and filtering. Among them, the main steps need to use the BAM file (the full name of SAM is sequence alignment map, sequence alignment map. And the BAM file is the file in the binary format of the SAM file (B is taken from binary)) as the output file to write to the hard disk, in the next The step is to read it from the hard disk to the memory, and then proceed to the next step. [0003] In the process of realizing the present invention, the inventor finds that the prior art has the following problems: [0004] In the analysis of human whole genome data, the original data is generally about 100GB, and the main analysis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/20G16B30/00G16B40/00G06F3/06
CPCG06F3/061G06F3/0638G06F3/0676G16B99/00
Inventor 蔡文君何光铸王东辉孔令雪
Owner UNITED ELECTRONICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products