Biological information handling

A biological sub- and biological sequence technology, which is applied in the field of retrieving and/or associating the biological information, can solve problems such as not allowing data to be combed, and achieve the effects of fast and deterministic sequence generation, reducing errors, and improving speed

Pending Publication Date: 2021-09-28
生物海滩公司
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] It is widely accepted that the vast stores of biological data contain many secrets to be discovered, but currently available tools do not allow combing through said data in a sufficiently convenient manner - for example to identify targets for the treatment of a particular pathology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Biological information handling
  • Biological information handling
  • Biological information handling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0184] Embodiment 1: Correlating biological information according to the present invention

Embodiment 1a

[0185] Example 1a: Finding Biological Sequences with Equivalent Biological Functions

[0186] For an application in the agricultural domain, a proof-of-concept information retrieval was performed. In this proof-of-concept, HYFT TM The protein fingerprint "WIGLVFL" was identified as a relatively frequently occurring fingerprint within the domain. All protein sequences comprising "WIGLVFL" were then retrieved from the repository of processed biological sequences as described herein and the results analyzed. Notably, after studying the biological functions of the retrieved sequences using public databases, it was found that most of them were related to photosynthesis, and this was across different species. Therefore, discover HYFT TM "WIGLVFL" is an anchor associated with distinct but functionally related biological entities.

Embodiment 1b

[0187] Example 1b: Finding connections between related biological sequences

[0188] As another proof of concept, a simple text search was performed on protein sequences whose names included "fibroblast growth factor receptor 2." Corresponding results are retrieved from the repository of processed biological sequences as described herein. After analyzing the retrieved results, it was found that basically all protein sequences have "WSLIMES" or "WIKHVEK" as the most stringent HYFT TM (i.e. the longest HYFT with the lowest number of combinations TM ). Based on this, a repository of processed biological sequences and / or a repository of fingerprint data strings can be annotated with this information such that whenever seeking information about HYFTs TM "WSLIMES" and / or "WIKHVEK" may be used when these information are considered representative of biological entities.

[0189] Note that there may be different entry and exit points through HYFTs TM Link. For example, a text sea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

In a first aspect, the present invention relates to a computer-implemented method for obtaining information on a biological entity which is based on at least one biological sequence, comprising: (a) providing a repository of fingerprint data strings for a biological sequence database, each fingerprint data string representing a characteristic biological subsequence made up of sequence units, each characteristic biological subsequence having in the biological sequence database a combinatory number which is lower than the total number of different sequence units available thereto, the combinatory number of a biological subsequence being defined as the number of different sequence units that appear in the biological sequence database as a consecutive sequence unit of the biological subsequence; (b) determining one or more fingerprint data strings which are representative for the biological entity; (c) searching a repository comprising information associated with the fingerprint data strings for information associated with the one or more representative fingerprint data strings; and (d) processing the information.

Description

technical field [0001] The present invention relates to the processing of biological information, and more particularly to retrieving and / or correlating said biological information. Background technique [0002] Biological sequencing has advanced at an astonishing pace over the past few decades, making possible the Human Genome Project, which achieved the complete sequencing of the human genome more than 15 years ago. To drive this development, numerous technological advances are required, from advances in sample preparation and sequencing methods to data acquisition, processing, and analysis. At the same time, new scientific fields have emerged and developed, including genomics, proteomics, and bioinformatics. [0003] Driven by the emphasis on data acquisition in the post-genomic era, this development has led to the accumulation of large amounts of biological (e.g., sequence) data. However, the ability to organize, analyze and interpret this sequence to extract biologica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B30/20
CPCG16B30/20G16B50/50G16B30/10G16B15/00G16B20/20
Inventor D·范海夫特A·范海夫特I·布兰兹E·范海夫特
Owner 生物海滩公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products