Unlock instant, AI-driven research and patent intelligence for your innovation.

Self-loop Bidirectional Edge Recognition and Removal Method for Bidirectional Multi-step de Bruijn Graph

A bidirectional, multi-step, self-looping technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of low sequencing abundance, uninvolved identification and removal methods, etc., to increase the length and avoid damage. , the effect of improving quality

Active Publication Date: 2017-02-01
SHENZHEN INST OF ADVANCED TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] However, the above strategies for error correction and error removal are all based on the premise of uniform abundance, but in actual sequencing, some vertices are mistaken for error vertices due to the relatively low sequencing abundance, while SNPs or mutations in repetitive sequences are not. Possibly due to high deabundance, considered correct apex
In addition, the above-mentioned strategies are all for classifying and correcting the vertices of the De Bruijn graph, but for the identification and removal of self-loop bidirectional edges in the bidirectional multi-step De Bruijn graph, but none of them are involved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-loop Bidirectional Edge Recognition and Removal Method for Bidirectional Multi-step de Bruijn Graph
  • Self-loop Bidirectional Edge Recognition and Removal Method for Bidirectional Multi-step de Bruijn Graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0049] The invention provides a self-loop bidirectional edge recognition and removal method of a bidirectional multi-step De Bruijn graph, comprising steps,

[0050] S1. Read the sequencing data source file, and construct a bidirectional multi-step De Bruijn diagram;

[0051] S2, setting the data structure of each vertex u in the bidirectional multi-step De Bruijn graph, and identifying the self-loop bidirectional edge of the bidirectional multi-step De Bruijn graph;

[0052] If the out-degree of the vertex u minus the number deg of the self-loop edges of the vertex u is equal to 1 or 2, and one of the subscripts of the two non-self-loop edges is less than 4 and the other is greater than 3, then mark the Vertex u is an extensible self-loop vertex.

[0053] S3, removing the self-loop bidirectional edges of the bidirectional multi-ste...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for identifying and removing self-loop bidirectional edges of a bidirectional multistep De Bruijn graph, which comprises the following steps of: S1, reading a sequencing data source file and constructing the bidirectional multistep De Bruijn graph; S2, setting a data structure of each vertex u in the bidirectional multistep De Bruijn graph and identifying the self-loop bidirectional edges of the bidirectional multistep De Bruijn graph; and S3, removing the self-loop bidirectional edges of the bidirectional multistep De Bruijn graph. The method has the technical effects that (1) the method faces the bidirectional multistep De Bruijn graph; (2) according to target nodes of the self-loop bidirectional edges and target node directions of the self-loop bidirectional edges, the self-loop bidirectional edges are identified and distinguished; (3) according to the target node directions of the self-loop bidirectional edges, different self-loop bidirectional edge removal methods are adopted; and (4) error bidirectional edges can be effectively found and damage to correct bidirectional edges is avoided, so that the length of contigs is prolonged and more importantly, quality of contigs can be synchronously improved.

Description

technical field [0001] The invention relates to the field of gene sequencing, in particular to a self-loop bidirectional edge recognition and removal method of a bidirectional multi-step De Bruijn graph. Background technique [0002] Gene sequence analysis takes algorithms and mathematical models as the core, including: storage and acquisition of genetic data, sequence comparison, sequencing and splicing, gene prediction, biological evolution and phylogenetic analysis, protein structure prediction, RNA structure prediction, molecular design and drugs Design, metabolic network analysis, gene chip, DNA computing, etc. The close combination of biotechnology and computer information processing technology has accelerated the speed of processing biological information data, making accurate interpretation of biology in the shortest possible time and accelerating the development of bioinformatics. [0003] Gene sequence analysis is the analysis of massive gene sequence data to extr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F19/20
Inventor 孟金涛张慧琳彭丰斌魏彦杰冯圣中
Owner SHENZHEN INST OF ADVANCED TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More