Species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences

A protein sequence and amino acid technology, applied in sequence analysis, proteomics, genomics, etc., can solve the problems of not considering gaps, increased calculation amount, large error, etc., and achieve the effect of simple calculation method

Active Publication Date: 2016-04-20
WEIFANG MEDICAL UNIV
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When comparing two sequences, the simplest case is to ignore gaps and only select the starting point for comparison, but this method has large errors and is difficult to reflect the real situation
At present, the most commonly used method is alignment alignment, that is, to align sequences of different lengths by inserting intervals, but there are many ways to insert intervals, which leads to the complexity of alignment and greatly increases the amount of calculation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences
  • Species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences
  • Species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0021] Such as figure 1 As shown, the amino acid distance polymorphism comparison protein sequence of the present invention carries out the method for species classification, comprising the following steps:

[0022] S10: number each amino acid on the protein sequence;

[0023] S20: Calculate the distance between adjacent amino acids of the same type on the protein sequence;

[0024] S30: Count the number of occurrences of different distances between adjacent amin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention brings forward a species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences. The method comprises following steps: S10, numbering each amino acid of a protein sequence; S20, calculating the distance between every two adjacent proteins of the same kind in each protein sequence; S30, counting frequency of different distances between every two adjacent proteins of the same kind in each protein sequence; and S40, comparing statistical data in a pair-wise sequence manner, setting up distance matrixes and performing species taxonomy by calculating phylogeny trees in generating systems based on distance matrixes. The species taxonomy method based on diversity comparisons of distances among amino acids in protein sequences has following beneficial effects: differences among amino acid in sequences are shifted to be those among distances of amino acid so that space is saved and no gap is required; and the method is easy and helps to simplify calculation amount.

Description

technical field [0001] The invention belongs to the field of species identification, in particular to a method for classifying species by comparing protein sequences with amino acid distance polymorphisms. Background technique [0002] At present, according to the theory of evolution, if two protein sequences come from the same ancestor, there will be a certain degree of homology, and the closer the relative relationship, the higher the homology of the species, so it can be classified according to the sequence of amino acids in the protein sequence, and establish The phylogenetic tree of molecular evolution. At present, the Clustal algorithm proposed by Higgins and Sharp in 1988 is widely used. Firstly, multiple sequences are pairwise compared to construct a distance matrix to reflect the pairwise relationship between sequences, and then a phylogenetic tree is generated according to the distance matrix calculation. When two sequences are compared, the simplest case is to ig...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24G06F19/22
CPCG16B20/00G16B30/00
Inventor 孔登陈永王晓红
Owner WEIFANG MEDICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products