Analysis method and device for detecting point mutation based on third-generation sequencing data

A technology of sequencing data and analysis methods, applied in the field of bioinformatics analysis, can solve the problems of low sensitivity of low-frequency mutation detection, insufficient application scenarios, and inability to detect samples and sites at the same time

Active Publication Date: 2022-02-01
QITAN TECH LTD CHENGDU
View PDF12 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The PCR method has the characteristics of high sensitivity, and the technology is mature, but each pair of primers can only detect one mutation, cannot detect too many samples and sites at the same time, and the throughput is low
The cost of Sanger sequencing is low, but the amount of sample required is large, and the detection sensitivity for low-frequency mutations is low
Next-generation sequencing has the characteristics of high throughput, and the cost of sequencing is also decreasing year by year. However, the current methods and tools commonly used to detect point mutations are not high in specificity (such as Varscan) and low in sensitivity to low-frequency detection (such as Mutect). Or the use of partial assembly steps leads to too long running time (such as Mutect2), which cannot well meet the needs of point mutation detection
Although there are some methods for detecting point mutations developed based on three-generation sequencing data at this stage (as mentioned above), their respective shortcomings are also very obvious. The most important ones are limited by the quality of sequencing and the comparison algorithm or deep learning training set they rely on. The data distribution, etc., and the applicable scenarios are not wide enough, and the robustness (robust) is not enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Analysis method and device for detecting point mutation based on third-generation sequencing data
  • Analysis method and device for detecting point mutation based on third-generation sequencing data
  • Analysis method and device for detecting point mutation based on third-generation sequencing data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0092] Embodiment 1 uses the method analysis data of the present invention

[0093] 1. will contain BRAF-V600E, EGFR-L858R, EGFR-T790M, KRAS-G13D as well as AKT1-E17K The standard samples of the standard sample and the negative control sample NA12878 were prepared through the experimental library and repeated three times, and sequenced using the QNome-9604 nanopore sequencer to obtain 6 original long-read sequencing data, among which HUM964, HUM965 and HUM966 It is positive control data, and HUM967, HUM968 and HUM969 are negative control data.

[0094] 2. Extract 9 short sequences with a fixed length of 101 bp on the genome for the 5 target sites to be detected in step 1 according to their positions, in which the positions of the target sites on the extracted short sequences are respectively fixed at 11th base, 21st base, 31st base, 41st base, 51st base, 61st base, 71st base, 81st base and 91st base Base (ie D=10bp), the final set of 9 short sequence fragments containin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an analysis method and device for detecting point mutation based on third-generation sequencing data. The analysis method comprises the following steps: 1) extracting a first sequence subset containing to-be-detected point mutation; 2) extracting a seed sequence from the first sequence subset to obtain a second sequence subset; 3) obtaining an original data set with expected quality; 4) using the seed sequence pair of the second sequence subset to obtain N data sets containing target sequences; 5) carrying out point mutation detection analysis on the N data sets containing the target sequence; 6) allocating a weight W to the result of each point mutation in the N detection results; and 7) calculating a point mutation result and frequency thereof according to a formula. The invention further provides a device for detecting point mutation based on third-generation sequencing data. By using the method provided by the invention, the problem of false negative caused by low comparison rate due to random indel or relatively high sequencing error is effectively avoided from the aspect of data features, and meanwhile, the false positive result can be more effectively controlled.

Description

technical field [0001] The invention belongs to the field of sequencing technology and biological information technology analysis of sequencing data, and in particular relates to a method for detecting point mutations based on third-generation sequencing data, and also relates to a device and system for detecting point mutations based on third-generation sequencing data. Background technique [0002] A point mutation is a change in only one base pair. General point mutations can be base substitutions, single base insertions or base deletions; narrow sense point mutations are also called single base substitutions. Base substitutions are further divided into two types: transitions and transversions. Common methods for detecting gene point mutations include PCR, Sanger sequencing (first-generation sequencing) and next-generation sequencing. The PCR method has the characteristics of high sensitivity, and the technology is mature, but each pair of primers can only detect one mu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B30/10G16B20/50
CPCG16B30/10G16B20/50
Inventor 郎继东孙继国
Owner QITAN TECH LTD CHENGDU
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products