Method for automatically discovering and sequencing outdated webpage based on Web time inconsistency

A technology for automatic discovery and sorting methods, applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve problems such as modeling, reasoning and measurement of Web time inconsistency, and no systematic and in-depth research

Inactive Publication Date: 2011-11-23
WUHAN UNIV
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In short, the existing research has conducted in-depth research on the temporal Web, web page quality assessment and timeliness measurement, but has not done so in

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically discovering and sequencing outdated webpage based on Web time inconsistency
  • Method for automatically discovering and sequencing outdated webpage based on Web time inconsistency
  • Method for automatically discovering and sequencing outdated webpage based on Web time inconsistency

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0148] figure 1 Shown is the overall scheme of the present invention, and the scheme of the present invention is divided into two levels:

[0149] The first layer is the "theoretical framework". By analyzing the time sensitivity of the webpage, a multi-dimensional time model of the webpage is established, and the multi-dimensional time data extraction of the webpage is carried out according to the conceptual model of time knowledge, regular grammar and time extraction based on pattern algebra. Establish a web time inconsistency model, including a web page time inconsistency model, a web page and column time inconsistency model, and a different website time inconsistency model in the same column; conduct web time inconsistency reasoning and measurement, which includes two parts: the first part is web page time inconsistency reasoning and measurement, Inference based on web page time inconsistency is composed of two parts, which are divided into inferring unknown time dimension ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for automatically discovering and sequencing an outdated webpage based on Web time inconsistency. On the basis of time living of a webpage, The method comprises the following steps: establishing a multi-dimensional time vector of a webpage; extracting a multi-dimensional time value of the webpage; establishing a Web time inconsistency model; providing a novel method for reasoning and measuring the Web time inconsistency; constructing a principle framework for solving the Web time inconsistency; and applying the principle framework to (1) the automatic discovering and sequencing of the webpage when the website is outdated so as to release website maintainers from heavy manpower inspection and contribute to saving human resources, (2) quality arrangement of similar websites on the basis of time inconsistency measurement, such as sequencing of a government or university website in quality of information freshness, and (3) time sensitive information arrangement method for a search engine to ensure that a user can search latest information more convenient and improve the content quality and user evaluation of the website.

Description

technical field [0001] The present invention relates to technical fields such as temporal Web, webpage quality assessment and timeliness measurement, webpage information retrieval system based on time perception, web information extraction, temporal database, etc., especially relates to an outdated webpage automatic discovery and analysis based on web time inconsistency Sorting methods and systems. Background technique [0002] Temporal Web: In recent years, temporal Web has increasingly become the focus of scholars. In 2011, WWW (International World Wide Web Conference), the most authoritative international conference on Web academics, specially set up a "Temporal Web" seminar. Based on the temporal Web graph composed of various network snapshots at different time points, Na Dai et al. established a network surfing model to combine the refresh rates of each network [1] . Marius Pasca's research found that when retrieving time-stamped web documents, the user's search term...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 李石君甘琳杨莎刘世超刘咏宁李宇轩
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products