Use method of haplotype ancestral database

A haplotype and database technology, applied in the field of bioinformatics, can solve the problems of long time for ancestral analysis and incomplete classification of ancestral information in the database, so as to reduce the recognition time, speed up, and avoid labeling errors.

Active Publication Date: 2022-07-22
广州鸿溪见杉科技有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a method for using the haplotype ancestry database to solve the problems of incomplete classification of ancestry information in the database and too long time for ancestry analysis in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] The method of using the haplotype ancestry database, the database includes the Hapmap project, the International Thousand Genomes Project, Qiyun Nord, GWAS, of course, those skilled in the art can select other databases containing ancestral source data as needed, and the use steps include:

[0025] (1) Set an extraction box capable of extracting 10 consecutive SNPs, an extraction box capable of extracting 11 consecutive SNPs, an extraction box capable of extracting 12 consecutive SNPs... and an extraction box capable of extracting 200 consecutive SNPs, The sample haplotype SNP information of each ethnic group in the database is extracted respectively, and each sample haplotype is extracted by the above extraction frame from its 5' end to the 3' end by moving SNP sites one by one located in the extraction frame. continuous SNP site information, the above extraction boxes are moved simultaneously or in batches, and the information of several consecutive SNP sites extracted...

Embodiment 2

[0030] The difference from the first embodiment is that the identification frame only adopts the 5' end identification frame.

Embodiment 3

[0032] The difference from the first embodiment is that the identification frame only adopts the identification frame at the 3' end.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a use method of a haplotype ancestral database. The use method comprises the following steps of: moving an identification frame to identify information of n continuous SNP (Single Nucleotide Polymorphism) sites falling into the identification frame; matching the information of n continuous SNP loci in the recognition frame with the comparison units, and if the comparison units with the same information of the SNP loci cannot be matched, determining that the SNP loci are unknown; if a comparison unit with the same information of the SNP sites can be matched, recording corresponding race information of the comparison unit, calling an identification frame capable of identifying n + 1 continuous SNP sites, and identifying the n + 1 continuous SNP sites containing the n SNP sites which are the same as the comparison unit; and repeating the previous step until the information of a plurality of continuous SNP sites in the identification frame cannot be matched with the comparison unit with the same SNP site information. The method has the advantages that the ancestor tracing speed is high, the error is small, the definition is high, the big data capacity is large, and a basis is provided for finding out disease and gene association, precise medical treatment, molecular detection, medication guidance and other technical researches in the medical field.

Description

technical field [0001] The invention relates to the technical field of bioinformatics, in particular to the technology of tracing the ancestral origin of genes. Background technique [0002] SNPs are important genetic markers and are widely used in the field of bioinformatics. Mining the data hidden in SNPs has important guiding significance for the analysis of the treatment of genetic diseases, the analysis of the relationship between the severity of disease progression and genes, and the study of the difference between drugs on different populations and other related work in medical technology. [0003] Although scientists in the biological sciences have collected a large number of human genome data, and there are now multiple genome databases, human migration has occurred many times, and genetic recombination has led to the common phenomenon of mixing of different populations. However, the genes included in the existing databases The recorded ethnic information is not co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B50/00G16B20/30G16B20/20
CPCG16B50/00G16B20/20G16B20/30Y02D10/00
Inventor 宋清马丽
Owner 广州鸿溪见杉科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products