Whole genome sequencing data calculation interpretation method

A whole-genome sequencing and data computing technology, which is applied in computing, electrical digital data processing, special data processing applications, etc., can solve the problems of limited GPU hardware structure characteristics, high energy consumption, large area, etc., to achieve rich diversity, Speed ​​up processing and improve real-time effects

Inactive Publication Date: 2017-09-22
GENETALKS BIO TECH CHANGSHA CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage is: to achieve the same function, FPGA is generally slower than ASIC (Application Specific Integrated Circuit, ASIC), and the circuit area is larger than ASIC
However, there are two problems with the current GPU: First, due to the hardware structural characteristics of the GPU, many parallel algorithms cannot be effectively executed on the GPU; second, the GPU will generate a lot of heat during operation, and the energy consumption is high
To sum up, the above four computing devices have their own characteristics and limitations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Whole genome sequencing data calculation interpretation method
  • Whole genome sequencing data calculation interpretation method
  • Whole genome sequencing data calculation interpretation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] like figure 1 As shown, the implementation steps of the calculation and interpretation method for whole genome sequencing data in this embodiment include:

[0055] 1) Input the reference whole genome data and original sequencing sample data for whole genome sequencing; preprocess the original sequencing sample data based on CPU to obtain reliable data of sequencing samples; The genome data is preprocessed to obtain the reference whole genome data with index as the comparison object, and the CPU calls the index generator implemented by FPGA hardware to accelerate the generated index; if the reference whole genome data does not need preprocessing, directly The original indexed reference whole genome data is used as the comparison object; this step requires the use of two processors, CPU and FPGA;

[0056] 2) The CPU calls the FPGA to accelerate the alignment between the reliable data of the sequencing sample and the reference whole genome data with an index (alignment), ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a whole genome sequencing data calculation interpretation method, which comprises the following implementation steps that: inputting reference whole genome data used for whole genome sequencing and organic sequencing sample data, and carrying out preprocessing; calling an FPGA (Field Programmable Gate Array) by a CPU (Central Processing Unit) to be accelerated to compare sequencing sample reliable data with the reference whole genome data with an index to obtain a comparison result with a repeated sign and the index; calling the FPGA and a GPU (Graphics Processing Unit) by the CPU to be accelerated to carry out genome reassembling on the sequencing sample reliable data, and carrying out variant identification recognition on the comparison result with the repeated label and the index; and calling the GPU and a DSP (Digital Signal Processor) by the CPU to be accelerated to carry out visual processing, and calling a deep learning model realized by hardware on the FPGA by the CPU to carry out the analysis and the mining of the whole genome and a variant function on the basis of a visual processing result. By use of the method, the GPU, DSP and FPGA processors can be comprehensively utilized for acceleration, and the method has the advantages of being quick, real-time, accurate in penetration, popular and easy in understanding and varied in forms.

Description

technical field [0001] The invention relates to gene sequencing technology, in particular to a calculation and interpretation method for whole genome sequencing data. Background technique [0002] In recent years, with the widespread application of Next Generation Sequence (NGS) technology, the cost of gene sequencing has dropped rapidly, and gene sequencing technology has been promoted in a wider range of fields such as biology, medicine, health, criminal investigation, agriculture, etc. application. Among them, NGS-based Whole Genome Sequencing (WGS) is a very valuable branch field that has received extensive attention. [0003] Whole genome sequencing refers to the sequencing of all the genes in the genome of a biological individual, that is, the determination of the base sequence of its deoxyribonucleic acid (Deoxyribonucleic Acid, DNA). Genomic information has been used to identify genetic diseases, find mutations that drive cancer development, and track disease outbr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/20G06F19/22G06F19/24
CPCG16B30/00G16B25/00G16B40/00
Inventor 宋卓刘蓬侠李根
Owner GENETALKS BIO TECH CHANGSHA CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products