Constructing method of genetic marker reference system for group differentiating and identification, and genetic marker reference system
A technology of genetic markers and construction methods, used in special data processing applications, instruments, electrical digital data processing, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0089] This embodiment is used to illustrate how to use the method of the present invention to construct a reference set containing 16 SNPs from 55786541 SNPs for the distinction of Africans, Europeans and Asians ( figure 2 ,Table 1).
[0090] Specific steps are as follows:
[0091] 1. Data segmentation
[0092] Based on the 55,786,541 SNPs of 108 Africans, 313 Europeans, and 993 Asians in the 1000 Genomes Project (1000 Genomes Project), the data was segmented according to the intercontinental source of the population, and two types were obtained after segmentation. The first category is {Africa, (Europe, Asia)} and the second category is {Europe, Asia}.
[0093] 2. Data filtering
[0094] Calculate the F of the SNPs in each class ST value, and accordingly sort the SNPs in each class in descending order, and keep the top 20,000 SNPs.
[0095] 3. SNP selection
[0096] A feature selection algorithm was used to select a subset of 100 SNPs in each class after data filterin...
Embodiment 2
[0104] In this embodiment, the SNP reference frame selected from the data set of 178 SNPs is used by the method (AIM-SNPtag) described in the present invention. These 178 SNPs have been identified in "Li C-X, Pakstis AJ, Jiang L, Wei Y-L, Sun Q-F, Wu H, BulbulO, Wang P, Kang L-L, Kidd JR, Kidd KK. A panel of 74AISNPs: Improved ancestryinference within Eastern Asia. Forensic Science International: Genetics 23(2016) 101-110." Publicly reported in the article.
[0105] This embodiment is used to illustrate how to not go through step (2)---data filtering, and directly use step (1), (3) and (4)---data segmentation, SNP selection and integration optimization, from a smaller number of SNPs Concentrate selection to construct SNP reference system.
[0106] Specific steps are as follows:
[0107] 1. Data segmentation
[0108] Based on the Africans (AFR), Europeans (EUR), South Asians (SA), East Asians (EA) and Southeast Asians (SEA) in the Thousand Genomes Project (1000Genomes Projec...
Embodiment 3
[0121] This example is used to illustrate how to use the method of the present invention to select a reference set containing 47 STRs from 670,646 STR loci for distinguishing Africans, Europeans and Asians. This embodiment only involves steps (1) to (3) of the method of the present invention, and does not involve step (4).
[0122] Specific steps are as follows:
[0123] 1. Data segmentation
[0124] Based on the 670,646 STRs of 108 Africans, 313 Europeans, and 993 Asians in the 1000 Genomes Project (1000 Genomes Project), the data is segmented according to the intercontinental source of the population, and the segmentation categories are {Africa, Europe, Asia}.
[0125] 2. Data filtering
[0126] Firstly, STR sites with more than 10% missing data were filtered out; a total of 90,537 STR sites passed this filtering criterion. Then, calculate the F that preserves the STR ST value, and accordingly sort the STRs in each class in descending order, and keep the first 20,000 ST...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com