Self adaptive net paper updating time predicting method

A technology of update time and prediction method, which is applied in special data processing applications, instruments, electrical digital data processing, etc. It can solve the problem that the next update time has a large difference, the proximity method is difficult to adapt in time, and the proximity method has a slow convergence speed, etc. problem, to achieve the effect of guaranteeing freshness, reducing system overhead, and good performance

Inactive Publication Date: 2007-04-11
上海态格信息技术有限公司
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of this method is that it is relatively simple, but the disadvantage is that if the set initial update time is quite different from the actual next update time of the web page, the convergence speed of the proximity method will be slower. In addition, if the update frequency of the web page changes abruptly, the adjacent It is also difficult for the law to adapt to this sudden change in time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self adaptive net paper updating time predicting method
  • Self adaptive net paper updating time predicting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0022] Take a web page of the yahoo community:

[0023] Take http: / / cn.bbs.yahoo.com / message / read_talkcar_174080.html as an example, this is a BBS page, take its first 60 update time series (this series can be read directly from the web page), and take the first The value is a reference, and the sequence is converted into seconds, then the sequence is:

[0024] 0 935 231883 261484 277037 314594 346493 346601

[0025] 355709 401795 402343 408114 445925 493502 530610

[0026] 580559 596884 620318 668050 680267 680267 680270

[0027] 680282 686234 686533 686609 691639 695092 699361

[0028] 699813 751811 786379 786384 790780 826472 847222

[0029] 856377 873258 873687 876733 927321 1014280 1018088

[0030] 1019502 1027354 1047183 1049073 1086272 1086275 1092288

[0031] 1103902 1128980 1135175 1135295 1137836 1195896 1214459

[0032] 1223416 1261189 1304231

[0033] The minimum step size of the web page update time prediction component is set to mi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is a kind of prediction method adaptive for website update time, which is the improved adjacency method. It can predict the next updating time of website according to its history regulation, quickly predict the order of updating frequency in the absence of prior knowledge for the updating frequency of website, and adapt to the sudden change of updating frequency of website rapidly. Through MATLAB simulations, the method can accurately predict the website updating time. Compared with the classic adjacency method, this method can ensure the trendy property of the captured website under the condition of significant reducing system expense. The method adapts to the website grasp system, and its performance is excellent in a real application.

Description

Technical field: [0001] The invention relates to the field of Internet information processing, in particular to a method for predicting the update time of a webpage. Background technique: [0002] The exponential growth of web page information on the Internet has brought enormous pressure on the information collection of network application systems such as search engines. On the one hand, in order to keep the information fresh, it must be captured as frequently as possible Web pages, in order to obtain updated web pages in a timely manner; on the other hand, due to the limitation of hardware resources, it is necessary to crawl web pages with as low a frequency as possible to reduce invalid crawling (that is, crawling to non-updated web pages). Web page update time prediction is the key to solving the above-mentioned contradictions. Its purpose is to accurately predict the update time of web pages, so that web crawlers can acquire up-to-date web pages with minimal overhead. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 邱致中王少刚
Owner 上海态格信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products