Web temporal object model-based outdated webpage information automatic discovering method

An object model and automatic discovery technology, applied in the field of data quality research, can solve problems such as low data quality and weak processing

Inactive Publication Date: 2012-10-17
WUHAN UNIV
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0029] In view of the fact that the data quality is not high due to web tense inconsistency, and the current related technologies are relatively weak in dealing with such problems, the present invention aims to establish a web temporal object model to automatically obtain the temporal information extraction of web pages Path, learning temporal information extraction rules, constructing con

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web temporal object model-based outdated webpage information automatic discovering method
  • Web temporal object model-based outdated webpage information automatic discovering method
  • Web temporal object model-based outdated webpage information automatic discovering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] During specific implementation, the technical solution provided by the present invention can be realized by those skilled in the art by using computer software technology to realize the automatic operation process. The technical solution of the present invention will be described in detail below in conjunction with the drawings and embodiments.

[0064] figure 1 It is the implementation scheme of the embodiment of the present invention, which is divided into the following processes: firstly, it is the analysis of Web tense, mainly to establish a Web temporal object model (Ttree) for different temporal sensitivities of Web information. Then, time consistency constraints and reasoning are carried out for the established Web temporal object model. Finally, through the classification and measurement of temporal inconsistencies, the temporal inconsistencies of Web information are automatically found, and the web pages are checked.

[0065] The specific implementation steps ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a web temporal object model-based outdated webpage information automatic discovering method, belongs to the field of research of data quality, and relates to the technical fields of temporal Web, network information quality estimation, semantic comprehension and extraction of temporal information, constraint and reasoning of temporal information, automatic screening of consistency of webpage information and the like. Aiming at the phenomenon of low web data quality caused by temporal inconsistency, a Web temporal object model is established on the basis of different temporal sensitivities of different web-pages, network data temporal consistency constraint relation, reasoning mechanism and an algebraic operation rule are constructed, automatic screening is performed and outdated information in the Web webpage is promoted, and information with timeliness and relevancy is provided for a network user. The method can be widely applied to the aspects such as quality ordering of similar websites and search ordering of time perception, and the Web information quality is improved to a certain extent.

Description

technical field [0001] The present invention belongs to the research category of data quality, and involves technologies such as temporal web, network information quality assessment, semantic understanding and extraction of temporal information, establishment of temporal object model and constraint relationship, reasoning mechanism and algebraic operation system of temporal information In particular, a method for automatic discovery of outdated webpage information based on the Web Temporal Object Model is proposed. Background technique [0002] Temporal Web: This project belongs to the category of temporal Web research. In recent years, tense Web has increasingly become the focus of scholars. In 2011, WWW, the most authoritative international conference on Web academics, specially set up the "Temporal Web" Workshop - TWAW. Omar Alonso et al. analyzed the types of time information in documents, the expression and formalization of time, and the labeling of time, and pointed ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 李石君杨莎甘琳余伟王俊刘晶丁永刚王峰
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products