Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, device, storage medium and processor for processing high-throughput sequencing data

A sequencing data, high-throughput technology, applied in instrumentation, sequence analysis, proteomics, etc., can solve the problem of inaccurate sequencing data processing results, and achieve the effect of improving uniformity

Inactive Publication Date: 2020-08-07
LIAONING KEJUN BIOLOGICAL
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The main purpose of the present invention is to provide a method, device, storage medium and processor for processing high-throughput sequencing data, so as to solve the problem of inaccurate processing results of sequencing data in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, storage medium and processor for processing high-throughput sequencing data
  • Method, device, storage medium and processor for processing high-throughput sequencing data
  • Method, device, storage medium and processor for processing high-throughput sequencing data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0096] The above four Cosmic DNAs are prepared into a DNA library containing the above mutation sites, and then high-throughput sequencing is performed to obtain sequencing data. After quality control of the sequencing data, post-quality control reads are obtained. Three mutation frequency libraries were constructed for each site, and each mutation frequency library was replicated three times for a total of nine detections.

[0097] By using the target region amplification primer 1 and target region amplification primer 2 of the present application to screen the reads after quality control, the reads that do not completely cover the target region are excluded, and the reads that completely cover the target region are obtained. The specific results are shown in Table 2 below. It can be seen from Table 2 that based on the high-throughput and high-quality data of next-generation sequencing, the number and ratio of reads retained after screening still have sufficient data to ensure...

Embodiment 2

[0103]dPCR (digital PCR) is considered to be the closest detection method to the real mutation at present. Through the library construction of the four mutation sites in Table 1, 3 mutation frequency libraries were constructed for each site, and each mutation frequency library was repeated 3 times, and a total of 9 detections were performed. Then, the dPCR analysis method, the existing (conventional) second-generation sequencing data processing method and the method of the present application are respectively used to detect the detection site, and the detection results are shown in Table 3 below.

[0104] table 3:

[0105]

[0106]

[0107] Note: "-" indicates that the sample was not sequenced

[0108] As can be seen from the results in Table 3 above, for the normal locus of COSM6213, the detection results of the present application are similar to the results of the next-generation sequencing data processing method and the dPCR analysis method.

[0109] For Deletion at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a high-throughput sequencing data processing method and device, a storage medium and a processor. The high-throughput sequencing data processing method comprises the following steps of: screening high-throughput sequencing data by utilizing a target area amplification primer so as to obtain reads which completely covers a target area; and comparing the reads which completelycovers the target area with a reference genome so as to obtain a comparison result. According to the method, sequencing data which completely covers the target area is screened through the target area amplification primer, and by utilizing the sequencing data, coverage depths of both a 5' end and a 3' end of the target area are greatly improved, so that the problem that the detection result of the 3' end is incorrect as the coverage depth of the 3' end is lower than the coverage depth of the 5' end is solved.

Description

technical field [0001] The present invention relates to the field of sequencing data processing, in particular, to a method, device, storage medium and processor for processing high-throughput sequencing data. Background technique [0002] Next-generation sequencing gene mutation detection is a method of using a high-throughput sequencer to generate a large amount of DNA read (read length) sequence data, covering the same locus multiple times, and calculating the locus mutation frequency by the ratio of the number of mutated reads and unmutated reads. . [0003] In the current bioinformatics analysis methods for gene mutations, after the sequencing data is offline, the data is subjected to quality control (quality control). The quality control operation generally includes removing short reads (≤25bp reads), trimming the 3' end for sequencing Poor quality bases are then analyzed using all data sequencing data. When using the above methods for statistical analysis, there are...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10G16B20/20
CPCG16B20/00G16B30/00
Inventor 陶炳忠
Owner LIAONING KEJUN BIOLOGICAL