Query method for XML (Extensive Makeup Language) data

A query method and data technology, applied in the database field, can solve the problem of high IO cost and achieve efficient and effective query and management

Inactive Publication Date: 2014-01-01
PEKING UNIV
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The TurboXpath function is relatively sound, but for the XML data in the database, the TurboXPath algorithm scans the XML document from the beginning to the end, and the IO cost is very high, especially for those larger XML documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Query method for XML (Extensive Makeup Language) data
  • Query method for XML (Extensive Makeup Language) data
  • Query method for XML (Extensive Makeup Language) data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be described in detail below through specific embodiments and accompanying drawings.

[0032] figure 1 It is a flowchart of the XML data query method of the present invention, and the specific steps include:

[0033] 1) Use Native XML to store XML data in the database.

[0034] The XML data query method of the present invention belongs to the whole twig connection method. Compared with the earlier structured connection, the whole twig connection technology can avoid a large number of invalid intermediate results. The basis of the method of the invention is Native XML storage, and adopts DDE encoding mode for XML elements. The Native storage mechanism maintains the document order of XML elements, and the sub-documents rooted at an element can be retrieved through the physical address of an element's tag. DDE encoding is used to judge the common structural relationship of XML elements (ancestral descendants, father and son, brothers, etc.). ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a query method for XML (Extensive Makeup Language) data. The query method comprises the following steps: (1) adopting a Native XML manner to store the XML data, wherein a storage structure comprises an inner node layer, a leaf node layer and an inverted layer, wherein the inner node layer is used for storing nodes of an XML tree, an XML element is encoded by adopting a DDE (Dynamic Data Exchange) encoding manner, the leaf node layer is used for storing file data of the nodes of the XML tree; the inverted layer is used for storing an inverted index of the leaf node layer; (2) taking out an element sequence corresponding to the nodes of an XPath from the inverted layer according to an input XPath query statement and carrying out merge sort by adopting a loser tree; and (3) carrying out pushing and popping operation on XML elements subjected to the merge sort according to the sequence and obtaining a query result from a buffering region. According to the query method for the XML data, the XPath with a keyword OR and a wildcard character * can be processed and the efficiency is very high.

Description

technical field [0001] The invention belongs to the technical field of databases, and relates to a storage and query method for semi-structured data XML, in particular to an XML data query method that can effectively support XML query language XPath. Background technique [0002] As more and more application systems adopt XML as a standard format to publish and exchange data, the scale of XML data has expanded dramatically. According to a report recently released by IDC (Internet Data Center), among the IT departments of 500 interviewed companies 29% are heavily using XML documents and XML databases. How to effectively manage XML data has become an urgent problem to be solved. [0003] Quickly and accurately find all matching elements of XPath in XML database, which is the core operation of XML query processing. For example, an XPath expression: book[title='XML'] / / author[fn='Jane'AND ln='Doe'], the node author matched by this expression needs to satisfy: 1) There is a chil...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/81G06F16/835
Inventor 郭少松包小源陈薇王腾蛟杨冬青
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products