Unlock instant, AI-driven research and patent intelligence for your innovation.

A data set and method for high throughput sequencing somatic mutation detection performance evaluation

A technology for mutation detection and production methods, applied in the fields of instrumentation, genomics, proteomics, etc., can solve the problems of unsatisfactory performance of somatic mutation detection, and achieve the effect of improving authenticity and accuracy and promoting development.

Pending Publication Date: 2020-11-10
SOUTH CHINA UNIV OF TECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current detection performance of somatic mutations based on high-throughput sequencing is not satisfactory. For the same sample, there are large differences in the detection reports given by many commercial organizations. One of the main reasons is the lack of suitable data sets. Evaluation of the Accuracy of Somatic Mutation Detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data set and method for high throughput sequencing somatic mutation detection performance evaluation
  • A data set and method for high throughput sequencing somatic mutation detection performance evaluation
  • A data set and method for high throughput sequencing somatic mutation detection performance evaluation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0047] A dataset and method for performance evaluation of somatic mutation detection in high-throughput sequencing, comprising the following steps:

[0048] (1) Sample preparation and data preprocessing

[0049] Prepare 50 μg of standard DNA samples of NA12878 and Yanhuang No. 1 (YH-1) respectively, use Invitrogen Qubitfluorometer 3.0 to detect the concentration, and then use 1% agarose gel electrophoresis to purify. After the sample was purified, 1 μg of NA12878 and YH-1 genomic DNA was fragmented into DNA fragments of 100-500 bp in length using a CovarisE220 ultrasonic breaker, and exons were captured using the Agilent Human All Exon V5 Target Enrichment Bait Kit. Illumina adapters were then ligated to both ends of the DNA fragments using the Agilent SureSelectXT kit. After the adapter ligation was completed, the samples were amplified by PCR to obtain the Illumina exome sequencing library.

[0050] After the library was constructed, it was sent to the Illumina NovaSeq seque...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data set and method for evaluating the somatic mutation detection performance through high-throughput sequencing. A human reference genome hg19 mutation set, chromosomes, positions and base mutation information of the human reference genome hg19 mutation set are shown in a table 2. The application of the mutation set in evaluating the performance of the somatic mutation detection method comprises the following steps: sampling a BAM file to different sequencing depths, and combining a DNA sample to be detected with a positive control BAM file, so that different proportions of the DNA sample to be detected represent different mutation frequencies; carrying out mutation detection on the mixed data by using software to obtain a detection result, and comparing the detection result with a human reference genome hg19 mutation set; the higher the precision rate, recall rate and F value scores are, the higher the accuracy and authenticity of the method are. According to the method, the mutation true set is used for evaluating the performance of the mutation detection method, the sequencing depth and mutation frequency which are more suitable for being adopted are obtained, and the authenticity and accuracy of mutation detection results are effectively improved.

Description

technical field [0001] The invention belongs to the field of biotechnology, and in particular relates to a data set and a method for performance evaluation of high-throughput sequencing somatic mutation detection. Background technique [0002] Cancer is one of the major diseases threatening human health. Due to the heterogeneity of cancer, the efficacy of the same therapy can vary widely among different individuals with the same cancer type. Therefore, there is currently no single therapy that is universally applicable to all cancers, and more precise therapies are urgently needed to be developed. [0003] In recent years, great progress has been made in the treatment of cancer, especially targeted therapy. Most current targeted therapies are based on the detection of genetic mutations, for example, the tyrosine kinase inhibitors afatinib and erlotinib are used to target the epidermal growth factor receptor (EGFR) in non-small cell lung cancer ( EGFR) mutations; the serin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B20/50
CPCG16B20/50
Inventor 杜红丽陈子曦
Owner SOUTH CHINA UNIV OF TECH