Method and device for web page collection and recommendation

A recommendation method and webpage technology, which is applied in network data indexing, network data retrieval, unstructured text data retrieval, etc., can solve problems such as low efficiency, low accuracy of webpage collection and recommendation, and easy omission of webpages, so as to improve accuracy and efficiency, enhance experience, and reduce the chance of omission

Active Publication Date: 2019-07-19
INST OF SCI & TECHN INFORMATION OF CHINA
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] Aiming at the shortcomings of the existing web page collection methods, the present invention proposes a method and device for web page collection and recommendation to solve the problems in the prior art that the accuracy of web page collection and recommendation is low, or the efficiency is low, or it is easy to omit web pages, so as to improve Accuracy or efficiency of page recommendations, or reducing chances of missing pages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for web page collection and recommendation
  • Method and device for web page collection and recommendation
  • Method and device for web page collection and recommendation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0035] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the description of the present invention refers to the presence of said features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be unders...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a webpage collection recommendation method and device. The method comprises the following steps: acquiring a plurality of seed webpages relevant to a keyword; determining semantic concepts of the seed webpages; clustering the semantic concept of each seed webpage, and obtaining a semantic structure; acquiring a plurality of webpages linked to each seed webpage as a plurality of extended webpages; determining a similarity between each extended webpage and the semantic structure; according to an extended webpage whose similarity is not lower than a first similarity threshold, updating the seed webpages; evoluting the semantic structure according to the updated seed webpages periodically and updating the extended webpages, and updating the seed webpages according to the evoluted semantic structure until reaching a preset webpage acquisition termination condition; and sorting and recommending various seed webpages which are updated in the last cycle. By using the embodiment of the invention, the webpages can be collected and recommended for users more accurately and comprehensively, which facilitates the users to search the needed webpages, and improve the user experience.

Description

technical field [0001] The present invention relates to the technical field of network data collection, in particular, the present invention relates to a method and device for collecting and recommending web pages. Background technique [0002] With the rapid development of the Internet, the amount of information on the network is increasing day by day. Users usually use search engines to find desired web page information on the Internet. [0003] The search engine (Search Engine) uses web crawlers to grab web pages from the Internet in advance according to the specified strategy, and after receiving the keywords entered by the user, it calculates the correlation between the keywords and the pre-caught web pages, and selects the correlation degree. Higher pages are recommended and displayed to users. [0004] Currently, web crawlers mainly include traditional crawlers and theme crawlers. [0005] A kind of webpage collection recommendation method utilizing traditional cra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/955
CPCG06F16/35G06F16/955
Inventor 刘耀龚幸伟
Owner INST OF SCI & TECHN INFORMATION OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products