XML keyword query method

A query method and keyword technology, applied in the field of information retrieval, can solve problems such as incomplete semantics of returned results, loss of meaningful results, unsupported result sorting, etc., to ensure semantic integrity, reduce time complexity, and ensure query Accuracy effect

Active Publication Date: 2014-11-26
HOHAI UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Most of these semantics have problems such as incomplete return result semantics, returning many meaningless results, missing meaningful results, or not supporting result sorting, resulting in low query quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XML keyword query method
  • XML keyword query method
  • XML keyword query method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solution of the invention will be described in detail below in conjunction with the accompanying drawings.

[0044] The XML keyword query method proposed by the present invention is as follows: figure 1 The following steps are shown:

[0045] Step 1, according to the BLCEA query semantics, find the query keyword sequence Q=(k 1 ,k 2 ,...,k n ) BLCEA node set, n is a natural number:

[0046] Step 1-1, initialize the BLCEA node set to be empty;

[0047] Step 1-2, get the ordered LDewey coded set of all matching nodes of keywords figure 2 The XML document in and the query keyword sequence Q={title,XML,2013}, the ordered LDewey coded set of the matching nodes of the obtained keywords like image 3 shown;

[0048] Steps 1-3, first find the highest possible level of the BLCEA node of the query keyword sequence Q, the highest level of matching nodes in the inverted index table of the keyword title is 5, and the matching node in the inverted index tabl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an XML keyword query method, belongs to the field of information retrieval, and discloses BLCEA query semantics and a solving algorithm thereof and a method for semantic return result ranking. According to the BLCEA query semantics, after entity offspring containing all query keywords are omitted on the basis of classification of nodes in an XML document, an entity node still meeting query conditions is defined as a meaningful BLCEA semantic entity, and the recall ratio and semantic integrity of a query result are guaranteed. The method for return result ranking is designed by combining the matching degree and compactness of the keywords in a sub-tree with the result node as the root, time complexity of XML keyword query is lowered, and the precision ratio of the keywords is guaranteed when fuzziness exists.

Description

technical field [0001] The invention relates to an XML keyword query method and belongs to the field of information retrieval. Background technique [0002] Due to its scalability, flexibility and self-description, XML has gradually become the standard for data definition, storage and exchange on the Internet, so how to effectively store, manage and retrieve XML data has become a research hotspot. Existing XML query methods are mainly divided into two types: structured query and keyword query. The former requires users to understand the syntax mechanism of structured query language and the schema information of XML documents, which is not suitable for ordinary users, while the latter only requires user input. XML documents can be retrieved by simply querying keywords, which has become the main means of XML retrieval. [0003] Currently, keyword query methods are mainly divided into two categories: those that do not support result sorting and those that support result sortin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/8373
Inventor 冯钧朱祖会唐志贤许潇杜丙帅査显月王纯李宗祥魏童童朱跃龙李士进万定生
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products