Webpage sampling method oriented at barrier-free webpage content detection

A technology for web content and accessibility, which is applied in the field of web page sampling for web content accessibility detection. It can solve the problems of omission, inability to cover, and error in accessibility detection results, and achieve the effect of good algorithm effect and low time complexity.

Active Publication Date: 2014-05-28
ZHEJIANG UNIV
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But in fact, this assumption is not true. For detection items such as "error reason prompt (level 2)" and "input prompt (level 2)", the detection point is usually located on a webpage with components such as search box, registration, and login, and Not every webpage has detection points for such entries. If random sampling is performed on the website, the sampled webpages may not be able to cover these detection points, resulting in errors in accessibility detection results. The key webpage sampling algorithm only selects some key webpages As a sampling sample, the proportion of error webpages in the sample may be very different from the proportion in the overall population, resulting in large errors in the detection results, and it is impossible to automatically obtain all key webpages of the website. Manually searching for these key webpages is costly and expensive. There may be omissions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage sampling method oriented at barrier-free webpage content detection
  • Webpage sampling method oriented at barrier-free webpage content detection
  • Webpage sampling method oriented at barrier-free webpage content detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The specific implementation of the present invention will be described in detail below in conjunction with the accompanying drawings, and the invention will be described in detail in conjunction with the process of illustration.

[0020] The present invention provides a webpage sampling method oriented to webpage content accessibility detection, comprising the following steps:

[0021] (1) Grab all relevant web page URLs of the website according to the URL to be detected;

[0022] (2) Cluster all URLs based on the minimum description distance (MDL);

[0023] (3) According to the sampling ratio λ, each cluster is randomly sampled to form a sample;

[0024] (4) Perform routine accessibility compliance testing on the sampling sample set.

[0025] The specific implementation process of URL clustering in step (2) is:

[0026] Step (2.1) For a URL set S, use T(s) to represent the set of all URL items in ssS, S(t) to represent the set of URLs containing a certain URL item t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a webpage sampling method oriented at barrier-free webpage content detection. The webpage sampling method includes capturing URLs (uniform resource locators) of all related webpages of a website according to a to-be-detected URL, creating an MDL (minimum description length) model for URL sets of the website, clustering all the URLs by an MDL-based greedy algorithm, performing random sampling on each cluster according to a sampling proportion lambada to acquire samples, and performing conventional barrier-free compliance detection on sample sets.

Description

technical field [0001] The invention relates to the field of web page barrier-free detection and transformation methods, in particular to a web page sampling method for web page content barrier-free detection. Background technique [0002] With the popularization and development of the Internet, more and more attention has been paid to the Internet access of the disabled. Their special needs for web browsing and use have put forward barrier-free requirements for website design. Therefore, it is necessary to carry out barrier-free testing on existing websites. In order to be able to understand the accessibility of the website in a short period of time, it is impossible to perform an accessibility test on all web pages of the website, so a web page sampling algorithm is required. The purpose of the invention is to reduce the proportion of manual detection, speed up the detection speed of the website and reduce the detection cost during the website accessibility detection proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36
Inventor 卜佳俊李凯周宇王灿王炜
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products