Genome sequence splicing method based on protein information
A protein sequence and genome sequence technology, applied in the field of bioinformatics, can solve the problems of failing to meet the requirements of sensitivity and accuracy, consuming a lot of time, and damaging the sensitivity of scaffolding results, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0097] The present invention will be further described below in conjunction with examples. The following contigs are represented as DNA sequences in this example.
[0098] Such as figure 2 As shown, a method for generating genome splicing sequence scaffolding based on protein information provided by the present invention includes the following steps:
[0099] Step 1: Preprocessing.
[0100] Its purpose is S1: Obtain the alignment information between the DNA sequence to be spliced and the protein sequence. The specific execution process is:
[0101] S11: compare the DNA sequence to be spliced with the protein sequence one by one to obtain an alignment file (out.psl).
[0102] Wherein, the alignment file includes alignment information between all matched DNA sequences and protein sequences, and each row in the alignment file represents an alignment information. If there are n matching positions between a DNA sequence and a protein sequence, then n pieces of comparison ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


