Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Keyword-Based XML Retrieval and Sorting Processing System and Method

A technology of retrieval sorting and processing system, applied in the field of information retrieval and XML data management, can solve the problem of returning result granularity, bad, unable to rule out the problem of returning result granularity, etc.

Active Publication Date: 2019-11-08
BEIJING INSTITUTE OF GRAPHIC COMMUNICATION
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] At present, the existing XML retrieval methods are generally based on SLCA semantics. Based on SLCA semantics, the results will be lost, mainly because the granularity of the results is too large or too small, and the query results of single keywords are not good.
The fewer keywords, the more prone to results granularity and return of meaningless information in the query results of SLCA semantics
The existing XML query result sorting methods are based on SLCA semantics, so the result granularity is quite different. Generally, only the structural relationship between keyword nodes and SLCA nodes will be considered, and the relationship between keyword nodes will not be considered in the case of fixed-range granularity. The relationship between the relationship and the level of the returned results, the returned results are relatively inaccurate, and the granularity of the returned results cannot be ruled out

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Keyword-Based XML Retrieval and Sorting Processing System and Method
  • A Keyword-Based XML Retrieval and Sorting Processing System and Method
  • A Keyword-Based XML Retrieval and Sorting Processing System and Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to better understand the present invention, some basic concepts are first explained:

[0065] XML data tree: The information contained in XML data can be divided into two parts, one is content, which is composed of text text and attribute values ​​contained in XML data, and the other is structural information, which is composed of the nesting relationship between tags of XML data. Therefore, the XML document to be queried can be abstracted into an XML data tree model, G=(V, E, r, A), where V represents the set of all nodes in G, E represents the set of all edges in G, and r represents the The root node, A is the set of labels of all nodes.

[0066] Node number path: path(v) is a node number path function, which is used to obtain the node number path of node v from root node r to v, and the number path of root node r is 1. The set of node numbers on the path from the target node to the root node is the node number path.

[0067] Fragment numbering path: Assumi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an XML retrieval sorting processing system and method based on keywords. The XML retrieval sorting processing system comprises six modules including a basic semantic node screening module, a document semantic analysis module, a keyword retrieval module, a unit structure analysis module, a semantic weighing module and a result screening module. By the adoption of the XML retrieval sorting processing system and method based on the keywords, the defects are overcome that because the granularity of results returned through a keyword query method based on SLCA structural keywords is inappropriate, irrelevant results with overlarge granularities are easily returned and returned results of a single keyword are bad; semantic fragments which is moderate in granularity and relatively independent and relatively accord with users' intentions. By sorting the XML results, users can select to return a fixed number of query results, the query accuracy is further improved, andusers' preferences are well met.

Description

technical field [0001] The invention relates to a keyword-based XML retrieval and sorting processing system and method, which belongs to the field of information retrieval and XML data management, and is mainly used for querying XML databases and performing data processing on query results. Background technique [0002] The keyword retrieval method provides a user-friendly and convenient way to retrieve XML documents. Users only need to submit keywords without mastering data schema information or learning complex query syntax. For unstructured data such as web pages and office documents on the WWW, search engines such as Google, Baidu, and Bing provide keyword retrieval functions, which can help users quickly obtain the required information from massive web data, and have become the most popular among ordinary users. The welcome way to get information on the Internet. XML has become the standard format for expressing and exchanging data on the Web, and a large amount of XML...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/81G06F16/33G06F17/27
Inventor 解凯叶宇姍曾庆涛河南南
Owner BEIJING INSTITUTE OF GRAPHIC COMMUNICATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products