XPath query optimization method and system

A query optimization and optimal technology, applied in the database field, can solve problems such as difficulty in establishing distribution assumptions, difficulty in structure storage and calculation, and restricting the flexibility of conversion rules, and achieve the effect of improving execution efficiency.

Active Publication Date: 2013-02-13
SOUTH CHINA UNIV OF TECH
View PDF2 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The irregularity of XML data is an important challenge to traditional statistical information methods. Its data distribution makes some traditional distribution assumptions difficult to establish, and the complexity of the structure brings storage and calculation difficulties to obtain relatively accurate statistical information. , the orderliness of XML data also restricts the flexibility of conversion rules
All these problems make it impractical to use the traditional cost estimation method in xml, which will bring large errors

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XPath query optimization method and system
  • XPath query optimization method and system
  • XPath query optimization method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0051] Firstly, document statistical information to be used in the XPath query optimization method of the embodiment of the present invention is introduced. Document statistics are used to estimate the selectivity of the result set of the path expression and the predicate path in the process of query optimization, and the estimated results will be used to estimate the cost of the connection operation, so as to select the optimal query for the query optimization method basis for the plan. There are two types of document statistics: structure statistics and value statistics. The structure statistics will be introduced first.

[0052] The statistical information of the document structure used in the XPath query optimization method of the embodiment of the present invention is hierarchical coding. Hierarchical coding is a breadth-first coding form, which is expressed as a binary string, defined as follows:

[0053] 1. For the root node of the XML document, define its level...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an XPath query optimization method and system. The method comprises the following steps of: counting structural summary information of an extensive makeup language (XML) document through hierarchical encoding; counting value summary information by using a value-coding histogram and RPST, and performing an optimization algorithm for query optimization on an XPath expression by utilizing the statistical information, wherein the query optimization algorithm comprises the following steps of: 101-102, initializing a data structure and processing a single-step path; 103, judging whether a non-estimated path exists; 104, judging the path type; 105-109, estimating connection with lowest cost in all possible connection of a long path, and updating a cost matrix and a result set matrix by using corresponding data; 110-114, estimating an arrangement sequence with lowest path in a predicate path, updating a cost matrix and a result set matrix by using the corresponding data, and resorting the predicates according to an optimized sequence; and 115, reconstructing a query plan. According to the XPath query optimization method and system, the XPath query sentences can be effectively optimized, and the execution efficiency of the XPath query sentences is greatly improved.

Description

technical field [0001] The invention relates to the technical field of databases, in particular to an XPath query optimization method and system. Background technique [0002] In recent years, more and more data are described by XML and transmitted and exchanged on the network. The rapid expansion of XML data has brought new problems to the field of computer science: how to effectively store and quickly retrieve XML data. To store and query massive XML data on the Internet in the form of a database is a mainstream idea about XML data processing at present. Query processing is one of the essential and most important functions of any database system. As an important part of query processing, query optimization technology is often the key factor affecting query efficiency. Due to the complexity of the XML data model and its increasing scale, as well as the complexity of the XML query itself, the performance of the XML query is often not ideal. People have used quite mature q...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李东梁晓翀
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products