Unlock instant, AI-driven research and patent intelligence for your innovation.

Abstract generating method of extensible markup language (XML) keyword search

A keyword and abstract technology, applied in the field of abstract generation for XML keyword retrieval, can solve problems such as inability to evaluate

Inactive Publication Date: 2011-04-06
PEKING UNIV
View PDF1 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the XSeek model does not give a quantitative calculation formula for each evaluation standard, so it cannot make an accurate evaluation of the degree to which the abstract meets each standard

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abstract generating method of extensible markup language (XML) keyword search
  • Abstract generating method of extensible markup language (XML) keyword search
  • Abstract generating method of extensible markup language (XML) keyword search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] A general XML document is an introduction to an entity (such as a person, company or country, etc.). The introduction information is generally various attribute information of the entity. For example, for a country, its attribute information includes the country name , Geographic location, population, etc. Measuring the importance of an attribute is also for a specific entity. For example, for a person, he has many attributes: name, birth date, address, age, nationality, etc. These attributes are useful for describing a person. There are differences in the degree of contribution. Generally, a person can be identified by name, while other attributes cannot. In the MRepA model of the present invention, W(e,a) is used to represent the weight of attribute a (including attribute name and corresponding value, namely attribute node + value node) to describing entity e (entity node):

[0056] W(e,a)=(Dist(a)·Expl(a,Q)) Corr(e, a)

[0057] Among them, Dist(a) is used to measure th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an abstract generating method of extensible markup language (XML) keyword search and a model for evaluating importance of abstracts. The model contains three evaluation elements of differentiation, definiteness and relevance, wherein the differentiation is used for measuring the strength of the differentiation of an attribute a, the definiteness is used for measuring the definiteness of the attribute a for query Q, and the relevance is used for measuring the relevance between the attribute a and an entity e. In the method provided by the invention, the model is used for quantitatively analyzing the importance of abstracts of XML keyword search, wherein the calculation formula is as follows: W(e, a)=(Dist(a). Expl(a, Q))Corr(e, a); and then, the most important top-K attributes are selected as abstracts for describing the entity. The invention solves the problem that the traditional XML keyword search lacks quantitative measurement of information importance.

Description

Technical field [0001] The present invention relates to XML retrieval technology, in particular to an XML keyword retrieval abstract generation method, which can be applied to XML keyword search engines and other structured or semi-structured data keyword search engines. Background technique [0002] Since its inception in 1998, XML documents are now widely used in the Internet, databases and other fields due to the characteristics of openness, self-description and simplicity, and have become the language standard for data exchange and integration on the Internet. With the emergence of a large number of XML documents, how to quickly find information that meets user needs from large-scale XML documents has become a research hotspot in the field of information retrieval and databases. A specific XML file such as figure 1 As shown, figure 2 Yes figure 1 The tree structure corresponding to the XML document shown. [0003] XML information retrieval can be divided into two categories: ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 邓志鸿江家健
Owner PEKING UNIV