Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and system for quickly comparing gene data

A genetic data and comparison technology, applied in the field of genetic analysis, can solve the problems of low comparison efficiency in the genetic data analysis process, complex algorithm structure, and long time-consuming, so as to improve throughput performance, reduce computing resources, and improve efficiency Effect

Active Publication Date: 2018-12-11
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF4 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, this algorithm is designed for the purpose of saving memory, and the algorithm structure is relatively complicated. It mainly compares genes through a serial method, which leads to excessive resource consumption, long time consumption and low comparison efficiency. The performance bottleneck of the entire genetic data analysis process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for quickly comparing gene data
  • A method and system for quickly comparing gene data
  • A method and system for quickly comparing gene data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The embodiment of the present application provides a method for quickly comparing genetic data. By mining the parallelism and pipeline process in the comparison process, the host is used in combination with the FPGA. The host is responsible for preprocessing the genes to be compared and the reference genes. The FPGA hardware platform executes the core module of the gene data comparison algorithm, which can improve the parallelism of the algorithm through the optimization of parallel pipeline, thereby improving the throughput performance of the algorithm execution, improving the efficiency of gene comparison, and reasonably integrating the obtained seeds. Reducing computing resources and optimizing from various aspects can achieve the effect of accelerating the gene comparison process.

[0068] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above drawings are used to distinguish similar objects,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a method for quickly comparing gene data, which comprises the following steps: an FPGA reads genes to be compared and a reference gene from a buffer, and thelength of the genes to be compared is L; the FPGA determines a plurality of target seeds from a reference gene accord to a gene to be aligned and a preset algorithm, and the target seeds are gene sequences match with the gene to be aligned for a certain length; the FPGA selects the seeds with the highest similarity from multiple target seeds as the optimal seeds; according to the position of the best seed in the reference sequence, the FPGA obtains the gene sequence with preset length, and the length of the gene sequence is greater than or equal to the length of the best seed; the FPGA scoresthe best seeds by calculating the shortest editing distance between the best seeds and the estimated sequence, and the score is used to indicate the accuracy and authenticity of the best seeds; the FPGA outputs an optimal comparison result according to the scoring of the optimal seeds. The throughput performance of the algorithm can be improved and the efficiency of gene alignment can be improved.

Description

technical field [0001] The present application relates to the field of gene analysis, in particular to a method and system for quickly comparing genetic data. Background technique [0002] The study of genes also has a profound impact on the major progress of human beings. The comparison of gene data is also the basic composition and important foundation of bioinformatics. The basic method of gene comparison is to arrange two or more sequences together to indicate their similarity. place. Intervals can be inserted in the sequence, and corresponding identical or similar symbols are arranged on the same column. The base pair can be regarded as the basic unit of DNA. A base pair is a pair of bases that match each other and are connected by hydrogen bonds according to certain matching rules. The bases that make up a base pair include A adenine, T Thymine, C cytosine, G guanine, the matching rules are A—T, G—C. [0003] Genetic data comparison is one of the longest time-consum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/18G06F19/22
Inventor 史宏志赵健崔星辰尹云峰
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products