Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for processing gene sequence data

A gene sequence and processing method technology, applied in the field of gene sequence data processing, can solve problems such as lack of biological information, and achieve the effect of improving effectiveness and solving the lack of biological information

Active Publication Date: 2012-12-26
BEIJING NOVOGENE TECH CO LTD
View PDF1 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to provide a method and device for processing gene sequence data to solve the problem of missing biological information easily caused by the processing method of gene sequence data in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for processing gene sequence data
  • Method and device for processing gene sequence data
  • Method and device for processing gene sequence data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0027] An embodiment of the present invention provides a device for processing gene sequence data, and the device for processing gene sequence data provided by the embodiment of the present invention will be introduced below.

[0028] figure 1 is a schematic diagram of a processing device according to an embodiment of the present invention, such as figure 1 As shown, the processing device of this embodiment includes: a receiving unit 10 , a constructing unit 20 , a saving unit 30 , an acquiring unit 40 , a computing unit 50 , a simplifying unit 60 and a cutting unit 70 .

[0029] Specifically, the receiving unit 10 is used to receive the sequencing data of the initial gene sequence; the const...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for processing gene sequence data. The method for processing the gene sequence data comprises the steps of: receiving a sequencing data of an initial gene sequence; building a de Brujin graph of the sequencing data; storing a first edge sequence in the de Brujin graph and each short sequence for forming the first edge sequence; obtaining depth information of each short sequence for forming the first edge sequence; calculating the depth information of the first edge sequence according to the depth information of each short sequence for forming the first edge sequence; and simplifying the de Brujin graph according to the depth information of each edge sequence in the de Brujin graph and each short sequence in the sequencing data, and cutting the simplified de Brujin graph to obtain a contig gene sequence of the sequencing data. By the method and device, the problem of biological information loss easily caused by the method for processing the gene sequence data in the prior art is solved, so as to achieve the effect of improving the assembling availability of the gene sequence.

Description

technical field [0001] The present invention relates to the field of data processing, in particular to a method and device for processing gene sequence data. Background technique [0002] The method of sequencing based on short-segment sequence data is becoming more and more mature. By constructing the de Bruijn diagram (de Bruijn) idea for genome sequence assembly software, a large number of whole gene sequences have been successfully assembled. However, when the existing assembly software performs gene sequence assembly, it does not consider whether the gene sequence used for assembly is a heterozygous gene or a homozygous gene. Taking diploid genes, which are mostly in the biological world, as an example to illustrate, in the prior art When assembling the diploid gene sequence, the diploid gene is assembled as a homozygous diploid, and one of the allele sites is randomly selected to be ignored, that is, discarded In one case of alleles, the diploid gene is treated as a h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/22
Inventor 王垚燊阮航李萌
Owner BEIJING NOVOGENE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products