Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Hole filling method and device based on third-generation sequencing sequence

A sequence and sequencing technology, applied in the field of bioinformatics, can solve the problems of high resource consumption and slow process of comparing sequencing sequences, and achieve the effect of improving speed and saving consumption

Active Publication Date: 2018-11-06
BEIJING NOVOGENE TECH CO LTD
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Embodiments of the present invention provide a method and device for filling holes based on third-generation sequencing sequences, so as to at least solve the problem that the process of comparing sequencing sequences is very slow in the process of genome hole filling for sequencing data, resulting in high resource consumption. big technical problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hole filling method and device based on third-generation sequencing sequence
  • Hole filling method and device based on third-generation sequencing sequence
  • Hole filling method and device based on third-generation sequencing sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0031] It should be noted that the terms "first" and "second" in the specification and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, and not necessarily used to describe a specific sequence or sequence. It should be understood that the data used in this way can be interchanged under ap...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a hole filling method and device based on a third-generation sequencing sequence. The method includes the following steps: comparing at least one sub-sequencing sequence contained in a third-generation sequencing sequence to obtain a first comparison result; extracting a sub-sequencing sequence located within a certain range of at least one gap sequence from the first comparison result to obtain at least one first extraction result; performing fine comparison on the first extraction result to obtain a second comparison result; extracting a sub-sequencing sequence located within a certain range of the at least one gap sequence from the second comparison result to obtain at least one second extraction result; assembling the at least one second extraction result to obtain a consistent sequence; and replacing an original sequence in a genome sketch with the consistent sequence, wherein the gap sequence is an unknown sequence. According to the scheme of the invention, the technical problem that in the process of performing genome hole filling on sequencing data, the process of comparing the sequencing sequence has a low speed, and large resource consumption is caused can be solved.

Description

Technical field [0001] The present invention relates to the field of biological information, in particular to a method and device for filling holes based on third-generation sequencing sequences. Background technique [0002] De novo assembly has become one of the main applications of second-generation sequencing (NGS) technology. Currently, there are a large number of software available for assembling draft genomes, such as Velvet, ABySS, SOAPdeno, etc. Nevertheless, the assembled scaffold will contain many gap sequences, and "N" is generally used to fill in the representative. Generally speaking, because the sequence of the low coverage and repetitive region is too complicated, the software cannot determine the corresponding sequence of the position during the assembly process, so it can only fill in the "N" base to represent the position sequence. Although the distance information of pair-end reads can connect contigs into longer scaffolds, it cannot solve the inherent probl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/22
Inventor 周义其李季张锦波蒋智李瑞强
Owner BEIJING NOVOGENE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products