Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for compressing and decompressing gene sequences

A gene sequence and decompression technology, which is applied in the field of computational biology and bioinformatics, can solve the problems of large storage space occupied by stored gene sequences and low compression rate of gene sequences, and achieve the effect of reducing storage space and improving compression rate

Active Publication Date: 2020-12-01
SAMSUNG (CHINA) SEMICONDUCTOR CO LTD +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] An exemplary embodiment of the present invention is to provide a method and device for compressing and decompressing gene sequences, so as to solve the technical problems in the prior art that the compression rate of gene sequences is low and storing gene sequences takes up a large amount of storage space

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for compressing and decompressing gene sequences
  • Method and device for compressing and decompressing gene sequences
  • Method and device for compressing and decompressing gene sequences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Various example embodiments will now be described more fully with reference to the accompanying drawings, in which some example embodiments are shown.

[0037] figure 1 A flowchart showing a method for compressing a gene sequence according to an exemplary embodiment of the present invention.

[0038] refer to figure 1 , in step S10, generate a variation reference sequence according to the high-frequency variation information and the standard reference sequence.

[0039] Here, it should be understood that biological genes can be described by the precise arrangement of base pairs of deoxyribose nucleic acid (Deoxyribonucleic Acid, DNA), that is to say, biological genes can be represented by A (adenine), G (guanine), An ordered sequence composed of four bases, T (thymine) and C (cytosine), that is, a gene sequence.

[0040] The gene sequences of different organisms have different lengths, and various existing genetic research institutions provide multiple standard refer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides methods of compressing and decompressing gene sequences and a device of compressing the gene sequence. The method of compressing the gene sequence includes: generating a variation reference sequence according to high-frequency variation information and a standard reference sequence; and compressing the to-be-processed gene sequence according to a matching result of the to-be-processed gene sequence and the variation reference sequence to obtain the compressed gene sequence. According to the above-mentioned methods of compressing and decompressing the gene sequences and the above-mentioned device of compressing the gene sequence, a rate of compression for the gene sequence can be increased, thus storage space of the gene sequence is reduced, and copying and transmission for the gene sequence are facilitated.

Description

technical field [0001] The present invention relates to the technical fields of computational biology and biological information, and more specifically relates to a method and equipment for compressing and decompressing gene sequences. Background technique [0002] Gene sequence is generated through the collection and sequencing of biological gene sequencing technology. It is the research basis of bioinformatics, genetics, genomics, medicine and many other fields, and has important scientific value and practical significance. With the increasing maturity and extensive use of next-generation high-throughput sequencing technology (Next-generation Sequencing, NGS), the time to obtain biological gene sequences has been greatly reduced, and the cost has been significantly reduced. Sequencing projects will be more commonly used in the biomedical field. [0003] At the same time, the storage capacity of genetic data is also increasing rapidly. Taking the whole gene sequencing resul...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10G16B35/20G16B20/30G16B40/00
Inventor 石永刚孔鑫令狐雄展郭世硕张周
Owner SAMSUNG (CHINA) SEMICONDUCTOR CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products