Fast and safe retrieval method, device and storage medium for dna sequence
A technology of sequence indexing and sequence, which is applied in the fields of fast and safe retrieval of DNA sequences, devices and storage media, and can solve the problems of not covering life insurance, long-term care insurance, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] Disclosed herein is a method for indexing DNA sequences (or more generally, genomic sequences, e.g., DNA sequences, RNA sequences, etc.) using finite memory tree source models such as (e.g., fixed-order or variable variable order) Markov models, context tree weighted (CTW) models (an illustrative approach used in this paper), etc. An index record against the DNA sequence is then constructed, including models and parameters. The estimated codeword length obtained by applying the same limited memory tree model to the query DNA sequence then serves as a basis for quantification, compared to the codeword length estimated by direct modeling of the query DNA sequence using CTW. A comparative measure that evaluates the similarity of query and index DNA sequences. For example, codeword length comparisons are computed using mutual information measures such as entropy or information gain (IG) or similar means.
[0019] This method preserves the privacy of patients whose DNA seq...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


