Method for calculating error and error rate of gene mutation detection analysis process

An error rate and error technology, applied in the field of gene mutation detection and analysis process error and error rate calculation

Inactive Publication Date: 2018-07-06
杭州米天基因科技有限公司 +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The direct Mendelian inconsistency between parents and children is mostly caused by false positives caused by sequencing errors, which can greatly help improve the accuracy of mutation detection methods: According to scientific research, when carrying out reproductive inheritance, it is about 1000 One mutation will be generated in 10,000 to 30 million nucleotides, but the current genetic testing and analysis process produces 1 or several detection errors in 100 nucleotides

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for calculating error and error rate of gene mutation detection analysis process
  • Method for calculating error and error rate of gene mutation detection analysis process
  • Method for calculating error and error rate of gene mutation detection analysis process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to illustrate the present invention more clearly, the present invention will be further described below in conjunction with preferred embodiments and accompanying drawings. Those skilled in the art should understand that the content specifically described below is illustrative rather than restrictive, and should not limit the protection scope of the present invention.

[0031] The invention discloses a method for analyzing gene site mutation data files by using Mendel's genetic law, and a method for calculating the error and error rate of the gene mutation detection and analysis process. Concrete scheme of the present invention is as follows:

[0032] The first step is to use a variety of gene mutation analysis software to preprocess the raw data of the gene to be analyzed to generate gene sample data, specifically:

[0033] Compare the original data of the gene to be analyzed with the standard genome, determine the position of the short sequence on the genome...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for calculating an error and an error rate of a gene mutation detection analysis process. The method comprises the steps of preprocessing gene original data of a family member to obtain gene sample data; by utilizing gene mutation detection software, analyzing the gene sample data to obtain a gene site mutation data file in a VCF format; and scanning the mutation data file to detect whether mutation site information of a child mutation file in a family meets a Mendelian inheritance law or not. The actual reproductive genetic mutation rate is extremely low, so that sites not meeting the Mendelian inheritance law are mainly caused by the error of the analysis process. A quantity and a ratio of the sites not meeting the Mendelian inheritance law are namely theerror and the error rate of the gene mutation detection analysis process, and provide a basis for subsequent algorithm optimization and software design.

Description

technical field [0001] The invention relates to the field of biological gene mutation analysis, in particular to a gene mutation detection and analysis process error and error rate calculation method. Background technique [0002] Genome sequencing is the process of using high-throughput sequencing technology to sequence the human genome at a high rate and comparing it with the standard human genome to obtain the genome sequence. In recent years, through large-scale epidemiological surveys, scientists have discovered that a large number of gene variations are closely related to human diseases. But how to analyze and process such a huge sequencing sequence is the most difficult problem now. Therefore, a large number of genetic data analysis software also came into being. [0003] The genome data analysis process mainly includes quality control, sequence alignment, mutation detection, and mutation annotation. At present, the most commonly used bioinformatics analysis softwa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/18
CPCG16B20/00
Inventor 杨兴礼付永全刘小军尹潼陶涛
Owner 杭州米天基因科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products