Bioinformatics data processing systems

A data and physical technology, applied in the field of map comparison, can solve the problems of unclear map and sequence aligner, expansion is not straightforward, simple, different, etc., to achieve the effect of improving sensitivity and runtime, and reducing costs

Inactive Publication Date: 2018-01-02
AGENCY FOR SCI TECH & RES
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since maps represent ordered lists of continuous values, this extension is not straightforward, especially when considering multiple sources of map drawing errors and their high error rates (Muggli MD. et al., Efficient Indexed Alignment of Contigs to Optical Maps, In Algorithms in Bioinformatics (WABI 2014), LNCS Volume 8701, Wroclaw (Wroclaw, Poland: Springer; 2014p.68-81)
Also, since error rates can vary across technologies, and even across different runs on the same instrument, it is not clear that a generally sensitive map-to-sequence aligner is feasible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bioinformatics data processing systems
  • Bioinformatics data processing systems
  • Bioinformatics data processing systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] Detailed description of the preferred embodiment

[0075] Describes a novel seeded and extended global-local (glocal, short for global-local) alignment method, OPTIMA (and a sliding window extension for overlapping alignments, OPTIMA-Overlap), which allows for continuous-valued Mapping data is indexed, taking into account mapping errors. Also presented is a novel statistical model for conservatively assessing the significance of alignments without relying on expensive permutation-based tests, which is agnostic with respect to technique-dependent error rates.

[0076] As will be shown, OPTIMA and OPTIMA-Overlap are advantageous over prior art approaches because OPTIMA and OPTIMA-Overlap are more sensitive (1.6–2 times sensitive), more efficient (170–200%), and in terms of their alignment More precise (almost 99% accurate). These advantages are independent of the quality of the data, showing that the indexing approach and statistical evaluation of the present invention ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Disclosed is a computer-implemented method of determining at least one optimal alignment of at least part of a first map to at least part of a second map or a plurality of second maps, wherein the maps are physical genome maps and / or restriction maps. The method comprises: receiving first map data indicative of a first ordered list of distances between features of the first map, receiving second map data indicative of a second ordered list of distances between features of the second map or second maps; generating, from the second map data, seed data indicative of a plurality of seeds, each seed comprising at least one of the distances in the second ordered list, wherein the features are restriction sites and distances are fragment sizes. The said method further comprises generating a plurality of candidate alignments from the seed data by searching at least part of the first ordered list to find at least approximate matches for respective seeds, and extending the approximate matches bydynamic programming; determining respective alignment scores for respective candidate alignments; and selecting one or more of the candidate alignments as an optimal alignment or optimal alignments,based on the alignment scores.

Description

field of invention [0001] The present invention relates generally to methods and systems for map comparisons, particularly map-to-sequence (map-to-sequence) alignments, and more particularly to methods and systems for determining the relationship between at least a portion of a first map (e.g., a first physical genome map) and Methods and systems for at least one optimal alignment of at least a portion of a second profile or multiple second profiles (eg, a second physical genomic profile). Background of the invention [0002] Resolving complex repeat structures and rearrangements in the assembly and analysis of large eukaryotic genomes typically resorts to a combination of high-throughput sequencing and genome mapping techniques (eg, optical restriction mapping). Specifically, mapping techniques are able to generate sparse maps of large DNA fragments (150 kilobase pairs (kbp) to 2 Mbp) and thus provide a unique source of information for distinguishing complex rearrangements ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/22G16B30/10
CPCG16B30/00G16B30/10
Inventor D·韦尔佐托N·纳加拉贾
Owner AGENCY FOR SCI TECH & RES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products