Gene sequence reading method and reading system

A gene sequence and analysis method technology, applied in the field of bioinformatics, can solve problems such as consuming a large amount of memory, requiring high memory requirements for stand-alone storage, and difficulty in finding algorithms, so as to improve efficiency and accuracy, and reduce memory consumption.

Inactive Publication Date: 2017-09-08
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF1 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing software technologies are all aimed at specific sequencing platforms and some error types of gene sequencing sequences, and there is no software technology that can be applied to most platforms and solve various sequence errors better, so it is sometimes difficult Find the right algorithm for error correction
Read the measured data directly, and the existing error sequence will consume a lot of memory
[0004] Moreover, in the current process of reading gene sequences, serial reading of files is often used to read DNA data, which requires high memory requirements for stand-alone storage; in addition, although the Abyss algorithm realizes the de Bruijn graph Distributed builds, however, do not say anything about the entire parallel file reading method
[0005] It can be seen that the prior art sequence reading method has the problems of sequencing platform limitation and high memory consumption for storing DNA data files on a single machine, which needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gene sequence reading method and reading system
  • Gene sequence reading method and reading system
  • Gene sequence reading method and reading system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the object, technical solution and advantages of the present invention clearer, various embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. However, those of ordinary skill in the art can understand that, in each implementation manner of the present invention, many technical details are provided for readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in each claim of the present application can be realized.

[0045] In view of the prior art, the gene sequence reading method has the problems of sequencing platform limitation and high memory consumption of DNA data files stored on a single machine, in order to reduce the memory consumption of DNA data files stored on a single machine, the embodiment of the present invention uses In...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a gene sequence reading method and reading system. The method comprises: determining a suitable error correction algorithm according to relevant information of an original gene sequence, and performing error correction on the original gene sequence by using the error correction algorithm, to obtain a to-be-processed gene sequence; blocking the to-be-processed gene sequence according to a preset process quantity and a total size of the to-be-processed gene sequence, to obtain multiple blocked gene sequences; and concurrently reading the blocked gene sequence. By applying the technical scheme of the present invention, memory consumption of a stand-alone machine can be reduced, and efficiency and accuracy of reading the gene sequence are greatly improved.

Description

technical field [0001] The invention relates to the technical field of biological information, in particular to a gene sequence reading method and a reading system. Background technique [0002] In the current gene sequencing process, there are some differences in the sequence errors generated by different sequencing platforms, mainly the following four types: base substitution (substitution) error, base insertion (insertion) error, base deletion ( deletion) errors and ambiguous bases (such as using N to indicate the possible 4 bases A, C, G, T). For example, the errors of the Illumina platform are mainly base substitution errors, and the errors of the RS platforms of Roche 454, Heliscope, Ion Torrent and PacificBioscience are mainly base insertion and deletion errors. From the algorithm basis of the error correction method, there are mainly three kinds of error correction methods currently used: 1) method based on k-spectrum; 2) method based on suffix tree / suffix array; 3)...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/22
CPCG16B30/00
Inventor 滕彦宁魏彦杰孟金涛郭宁葛健秋
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products