Metadata schema matching method based on XML (extensive markup language) document

A pattern matching and metadata technology, applied in the database field, can solve problems such as ignoring metadata similarity relationship, ignoring metadata semantic similarity, instance similarity relationship similarity, etc.

Active Publication Date: 2013-03-20
INFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER +3
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing work is facing a new problem with the purpose of enriching the semantic information of metadata schemas and the task of merging and constructing multi-source heterogeneous metadata schemas.
At present, the metadata pattern matching algorithms mainly include the logical structure matching algorithm based on regular expression rules and the metadata matching algorithm of XML documents based on the hidden Markov model, but the logical structure matching algorithm based on regular expression rules mainly considers XML documents. The logical structure similarity between the metadata, ignoring the metadata semantic similarity, instance similarity and relationship similarity and other factors, and the metadata matching algorithm based on the hidden Markov model mainly extracts part of the metadata in the header of the XML document. Data information, ignoring the similarity relationship of other metadata

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metadata schema matching method based on XML (extensive markup language) document
  • Metadata schema matching method based on XML (extensive markup language) document
  • Metadata schema matching method based on XML (extensive markup language) document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Combine below figure 1 This application is further described.

[0036] (1) Calculate the semantic similarity of two metadata

[0037] Computing the semantic similarity of metadata mainly uses the similarity of word formation to find the similarity between concepts. The semantic similarity reflects the similarity of two metadata in linguistics. Since the metadata m 1 and m 2 The names of are all represented by strings, so the metadata m can be measured according to the synonym matching of strings 1 and m 2 similarity between.

[0038] (2) Calculate the attribute similarity of two metadata

[0039] The attribute of metadata consists of two parts: one part is the attribute name, which reflects the content of the attribute, and the other part is the attribute type, which limits the value range of the attribute's parameters. Calculate metadata m 1 and m 2 The attribute similarity needs to comprehensively consider the number of data attribute intersections of binary d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a metadata schema matching method based on an XML (extensive markup language) document, which comprises the steps of calculating the semantic similarity, the attribute similarity, the instance similarity, the structural similarity and the relation similarity of two metadata, setting weight in accordance with the specific XML document, and finally, calculating the comprehensive similarity. The metadata schema matching method based on the XML document provided by the invention has the advantage that the calculating result is more accurate during matching of the metadata schema.

Description

technical field [0001] The invention relates to metadata pattern matching, in particular to a method for calculating semantics, attributes, instances, structures and relational similarities of metadata of XML documents, and belongs to the technical field of databases. Background technique [0002] Extensible Markup Language (eXtensible Markup Language, XML) is a set of rules for defining semantic markup, through which users can create a document type definition (Document Type Definition, referred to as DTD) rule set, XML as a unified conversion syntax and exchange format, for Developers and users provide a standard way to exchange metadata information, so that metadata can be exchanged conveniently and concisely between OMG UML-based modeling tools and OMG MOF-based Metadata Repository. Metadata is data about data, which is used to describe information about the content, coverage, quality, management method, owner of the data, how the data is provided, etc. of a feature, dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 朱晓燕何金陵潘留兴赵鑫
Owner INFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products