Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for detecting website content

A detection method and content technology, applied in the field of information detection, can solve problems such as low accuracy

Active Publication Date: 2020-03-03
BEIJING GRIDSUM TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The main purpose of this application is to provide a method and device for detecting website content, so as to solve the problem of low accuracy in judging whether the content of the website has been updated in related technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for detecting website content
  • Method and device for detecting website content

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0021] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is an embodiment of a part of the application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of this application.

[0022] It should be noted that the terms "first" and "second" i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a website content detection method and apparatus. The method comprises the steps of obtaining any list page in a crawled page set; obtaining first crawled contents and second crawled contents, wherein the first crawled contents are contents of a list page crawled at a first moment in a target time segment, and the second crawled contents are contents of a list page crawled at a second moment in the target time segment; obtaining a first character string and a second character string, wherein the first character string is a character string formed by splicing link addresses of links in the first crawled contents according to a preset sequence, and the second character string is a character string formed by splicing link addresses of links in the second crawled contents according to the preset sequence; and detecting whether contents of a target website in the target time segment are updated or not according to the first character string and the second character string. Through the method and the apparatus, the problem of relatively low accuracy during judgment whether the website contents are updated or not in related technologies is solved.

Description

technical field [0001] The present application relates to the field of information detection, in particular, to a method and device for detecting website content. Background technique [0002] The content update of the website refers to the release of new pages on the website. When most websites release a new page, they will display the link of the newly released page on the homepage of the website, and even many will display the introduction of the new page. Website home page content update means that there are new changes in the content of the website home page, most of the time, the link list of some columns is updated. Whether the content of the website is updated or not is an indicator that mainly reflects the activity of the website and the validity of the information. Especially for government websites, this is also an important indicator for higher-level governments to assess lower-level government portals. Relevant government documents clearly stipulate that all ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/958G06F16/951G06F16/953G06F16/9535
CPCG06F16/951G06F16/958
Inventor 冯鸳鹤孙德彬
Owner BEIJING GRIDSUM TECH CO LTD