Unlock instant, AI-driven research and patent intelligence for your innovation.

Retrieval method and device for theme webpage

A theme and webpage technology, applied in the field of retrieval methods and devices for theme webpages, can solve the problems of low retrieval accuracy and the like

Active Publication Date: 2018-02-02
SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can be seen that this method has low retrieval accuracy when retrieving web pages of a specific topic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Retrieval method and device for theme webpage
  • Retrieval method and device for theme webpage
  • Retrieval method and device for theme webpage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work belong to the protection of the present invention. scope.

[0080] Such as figure 1 As shown, the embodiment of the present invention provides a method for retrieving a subject web page, and the method may include the following steps:

[0081] Step 101: Determine the search topic, and construct a thesaurus corresponding to the search topic;

[0082] Step 102: Obtain at least one retrieval webpage, and determin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a retrieval method and device for a theme webpage. The method comprises the following steps that: determining a retrieval theme, and constructing a theme lexicon corresponding to the retrieval theme; obtaining at least one retrieval webpage, and determining page contents and a URL (Uniform Resource Locator) address corresponding to each retrieval webpage; according to the word number of the page contents corresponding to the retrieval webpage, dividing the page contents into at least two paragraphs; according to the theme lexicon, determining at least one effective paragraph from the at least two paragraphs, and determining the paragraph proportion of the effective paragraph; determining whether the paragraph proportion is greater than a preset paragraph proportion threshold value or not, and if the paragraph proportion is greater than the preset paragraph proportion threshold value, determining at least one theme address word group from the URL address, and determining a word group proportion corresponding to the theme address word group; according to the determined word group proportion and paragraph proportion, determining a theme total score value corresponding to the retrieval webpage; and determining whether the theme total score value is greater than a preset total score threshold value or not, and if the theme total score value is greater than thepreset total score threshold value, determining the retrieval webpage as the theme webpage. By use of the scheme, accuracy for retrieving the theme webpage can be improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a retrieval method and device for a subject webpage. Background technique [0002] With the development of Internet technology, all kinds of data are presented through different forms of web pages. How to quickly and accurately retrieve web pages with specific topics from many web pages plays an important role in improving the efficiency of processing data related to specific topics. [0003] At present, when retrieving webpages of a specific topic, the retrieval is mainly based on the content of the pages included in the webpage. Specifically, the method first receives keywords of a specific topic input by the retrieval user, and then determines that the content of the page contains the keywords. Word web pages, and finally output these web pages. [0004] In this process, it is only determined whether the web page meets the retrieval requirements by determining whether the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/951G06F16/955
Inventor 徐同年刘瑞慧于晓晨朱斌王岗
Owner SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD