Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for identifying consistency of web page information

A web page information and identification method technology, applied in the field of web page information consistency identification, can solve the problems of low accuracy of identification results, low accuracy of correlation calculation and processing, etc.

Active Publication Date: 2017-05-24
ALIBABA GRP HLDG LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the low accuracy of correlation calculation processing, the accuracy of recognition results is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for identifying consistency of web page information
  • Method and device for identifying consistency of web page information
  • Method and device for identifying consistency of web page information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0050] see figure 1 , which is a method flow chart of a web page information consistency identification method disclosed in Embodiment 1 of the present application, the method includes the following steps:

[0051] Step 101: Acquiring the first category of webpage information from the database, the description object of the first category of webpage information is the same as that of the identified webpage information and located in the same category;

[0052] As a new type of information carrier, webpage information of a website is used to carry information of a specific object so that website users can browse, and the specific object is the description object of the website information. The description objects of webpage information on different websites are also different. For example, for shopping websites such as Taobao, Jingdong, Amazon, and Dangdang, the description objects can be products (ie, clothing, food, furniture, home appliances, books, etc.); for Youku, For vi...

Embodiment 4

[0119] Corresponding to the method for identifying the consistency of web page information in the second embodiment above, the fourth embodiment of the present application also provides an apparatus for identifying the consistency of web page information. see Figure 4 , which is a device schematic diagram of a web page information consistency identification device disclosed in Embodiment 4 of the present application, the device includes: an acquisition module 401, an extraction module 402, an analysis module 403, a calculation module 404, a selection module 405, and a statistics module 406 , an elimination module 407 , a judgment module 408 and a determination module 409 . The following will further introduce its internal structure and connection relationship in combination with the working principle of the device.

[0120] An acquisition module 401, configured to acquire a first type of web page information from a database, the first type of web page information is the same...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the invention discloses a method and a device for identifying consistency of webpage information. The identifying method includes acquiring a first kind of webpage information from a database; extracting header information and attribute information from the webpage information, subjecting the header information and the attribute information to word segmentation analysis to obtain attributes of a describing object; counting an attribute value of each attribute and co-occurrence information under a category, to which the describing object belongs; removing attribute values in the co-occurrence information from the attribute values to obtain a contradiction attribute value contained in each attribute; judging whether the attribute values of the header information and the attribute information in the identified webpage information are attribute values under the same attribute of the describing object; if the attribute values of the header information and the attribute information in the identified webpage information are attribute values under the same attribute of the describing object, determining the identified webpage information to be inconsistent, and otherwise, determining the identified webpage information to be consistent. According to the method and the device for identifying consistency of the webpage information, consistency of the webpage information can be identified, and identification efficiency is improved.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to a method and device for identifying consistency of webpage information. Background technique [0002] On the third-party shopping platform, the seller user publishes the product webpage through the platform, and the buyer user uses the search engine on the platform to find the webpage that meets the specific search conditions in the webpage published by the seller, and the search engine lists these webpages that meet the specific search condition Displayed to buyers and users in the form of search results, the search results are a collection of links to product webpages that meet specific search conditions. Links on product webpages generally contain product title information and product pictures. When buyers browse various When you are interested in a link because of the product title information and product pictures on the link, you will further decide to click on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/36G06F16/95
Inventor 韦袆冯景华陈明修
Owner ALIBABA GRP HLDG LTD