Supercharge Your Innovation With Domain-Expert AI Agents!

Method for improving accuracy of Web date semantic annotation

A technology of semantic annotation and accuracy, applied in the field of web pages, can solve problems such as the lack of methods for comprehensively utilizing existing Web database information and the logical relationship between Web data elements, and achieve the effect of improving performance

Inactive Publication Date: 2015-04-29
董永权
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] To sum up, none of the existing methods provides a method for comprehensively utilizing the existing Web database information and the logical relationship between Web data elements in the semantic annotation of Web data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for improving accuracy of Web date semantic annotation
  • Method for improving accuracy of Web date semantic annotation
  • Method for improving accuracy of Web date semantic annotation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0028] A method for improving the accuracy of Web data semantic labeling in the present invention is specifically carried out according to the following steps:

[0029] step 1,

[0030] Extend the traditional CRF model; introduce the model into credible constraints and logical constraints;

[0031] Step 2,

[0032] The integer linear programming reasoning method is adopted to introduce the credible constraints and logical constraints into the reasoning process at the same time, which significantly improves the performance of semantic labeling of Web data.

[0033] The credibility constraint refers to the credibility of different labels for each data element in the Web data object, which is obtained by using the existing Web database information to construct a label classifier. Logical constraints refer to the logical relationship between d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for improving the accuracy of the Web data semantic annotation which comprising the following steps of expanding a traditional CRF model; introducing the model into a dependable constraint and a logic constraint; and introducing the dependable constraint and the logic constraint into a reasoning process simultaneously by adopting an integral linear programming inference method so that the performance of the Web data semantic annotation is obviously improved. According to the model, the traditional CRF model is expanded and two constraints are simultaneously introduced into the reasoning process by adopting the integral linear programming inference method. The test result on the truthful data sets of several fields shows that the performance of the Web data semantic annotation is obviously improved by virtue of the model and the good basis is established for the Wed information extraction.

Description

technical field [0001] The invention belongs to the technical field of web pages, and relates to a method for improving the accuracy of semantic labeling of Web data. Background technique [0002] With the continuous development of WWW, a large amount of valuable information covering various fields has been stored in Web pages. Web data objects are just such semi-structured data objects organized by multiple data elements and optional semantic tags according to specific patterns. Accurately semantically annotate the Web data objects extracted from HTML pages, that is, assign a meaningful label to each extracted data element to represent the semantics of the data element, which will provide the necessary data for Web data integration Base. [0003] Research shows that similar Web data objects on different websites present a strong sequence. For example, on mainstream online marketing book selling websites, the title of the book usually precedes the description information o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/313
Inventor 董永权
Owner 董永权
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More