Check patentability & draft patents in minutes with Patsnap Eureka AI!

A method, system, equipment and medium for establishing a gene comparison table

A gene comparison and reference genome technology, applied in instruments, sequence analysis, database indexing, etc., can solve problems such as slow table lookup calculation speed, increased memory pressure, and lack of table item information, so as to reduce comparison calculations and improve accuracy , the effect of improving operating efficiency

Active Publication Date: 2022-05-17
INSPUR SUZHOU INTELLIGENT TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Especially if multiple extension tables are created according to different seed lengths, the pressure on memory will increase exponentially
Moreover, the calculation speed of subsequent lookup tables will be very slow
If you simply read the reference seed at a certain interval, there will be many missing items, resulting in missing table item information and affecting the final accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method, system, equipment and medium for establishing a gene comparison table
  • A method, system, equipment and medium for establishing a gene comparison table

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0021] It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present invention, which will not be described one by one in the subsequent embodiments.

[0022] Based on the above purpose, the first aspect of the embodiments of the present invention proposes an embodiment of a method for establishing a gene comparison table. figure 1 Shown is a schematic diagram of an embodiment of the method for est...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method, system, device and storage medium for establishing a gene comparison table. The method includes the following steps: reading a subsequence of the first continuous length from a reference genome sequence as a seed, and determining the longest Read length; store multiple seeds in the buffer area in turn, use the first seed as the candidate seed, and judge whether the number of seeds in the buffer area reaches the threshold; in response to the number of seeds in the buffer area not reaching the threshold, judge the Whether the longest read length corresponding to the current seed stored in the cache is greater than the sum of the first length and the threshold; in response to the longest read length corresponding to the current seed being greater than the sum of the first length and the threshold, determine the hash value of the current seed Whether it is less than the hash value of the seed to be selected; and in response to the hash value of the current seed being less than the hash value of the seed to be selected, write the current seed into the gene comparison table, and update the current seed as the seed to be selected.

Description

technical field [0001] The present invention relates to the field of gene comparison, and more specifically refers to a method, system, computer equipment and readable medium for establishing a gene comparison table. Background technique [0002] The typical seed strand alignment program gene sequencing used by most whole gene comparisons, in order to quickly and accurately map the DNA subsequence to the reference genome, the general process is to collect the reference genome reference, and follow the K-mer or other algorithms. Shard into multiple seeds, and compile the seeds into a hash table. Then each sequence to be compared is divided, and its corresponding position on the table is retrieved by looking up the table. [0003] Now mainstream gene comparisons often use full-text indexes, such as suffix arrays or FM indexes. The advantage of this approach is that we can use seeds of any length, which helps to increase the uniqueness of the seeds and reduce unsuccessful ext...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10G16B50/00G06F16/22
CPCG16B30/10G16B50/00G06F16/2255
Inventor 葛沅史宏志尹云峰崔星辰
Owner INSPUR SUZHOU INTELLIGENT TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More