Method and device for searching web pages according to tendency values

A web search and oriented technology, applied in the field of information retrieval, can solve problems such as slow speed and limited number of web pages, and achieve the effect of improving response speed, low time complexity, and improving search satisfaction

Inactive Publication Date: 2011-06-29
SHANGHAI LAISEEK INFORMATION TECH +2
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In addition, if the existing search engine wants to obtain the propensity value of the nomenclature in the webpage, it can only analyze the propensity of the nomenclature in the search results after the search is completed, that is, online processing
The downside of such laggy online processing is that it is slow and the number of pages analyzed is limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for searching web pages according to tendency values
  • Method and device for searching web pages according to tendency values
  • Method and device for searching web pages according to tendency values

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The idea, specific structure and technical effects of the present invention will be further described below in conjunction with the accompanying drawings, so as to fully understand the purpose, features and effects of the present invention.

[0053] like figure 1 As shown, the present invention discloses a method for webpage search according to the propensity value, comprising the following steps:

[0054] Step 101, obtaining several webpages and downloading them to the webpage database;

[0055] The search engine company obtains several webpages from the Internet through the webpage fetcher, and downloads the several webpages to the computer of the search engine company, that is, the webpage database.

[0056] Step 102, carry out naming body recognition to the text of several webpages;

[0057] First, the named body recognizer scans each webpage, segments words on each webpage, and makes part-of-speech tagging;

[0058] Secondly, the nomenclature recognizer judges w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for searching web pages according to tendency values. The method comprises the following steps of: A), acquiring a plurality of web pages, and downloading the web pages to a web page database; B), performing name identification on characters of the plurality of web pages; C), performing tendency analysis on a plurality of names in the plurality of web pages to acquire the tendency values of the plurality of names of the plurality of web pages; D), making a forward index table, wherein the forward index table comprises the tendency values of the plurality of names; E), making an inverted index table, wherein the inverted index table comprises the tendency values of the plurality of names; F), inputting a search item, and decomposing the search item into at least one keyword; and G) calculating the sequencing weights of the web pages comprising the keywords according to the inverted index table, and outputting a search result. According to the method and the device, the web pages containing the search keywords are mainly sequenced according to the tendency values, so that the web pages with depreciative or ameliorative tendency are sequenced in front, and the search satisfaction of a user is promoted.

Description

technical field [0001] The invention relates to the fields of information retrieval and natural language processing, in particular to a method and device for searching webpages according to tendency values. Background technique [0002] The search results of existing mainstream search engines (such as Google, Yahoo, Baidu, etc.) do not consider the propensity value of the webpage or the propensity value of the keywords decomposed by the search item when sorting. [0003] At the Seventh World Wide Web Conference in 1998, Sergey Brin and Lawrence Page published a paper titled "The Anatomy of a Large-Scale Hypertextual Web Search Engine", which disclosed the index structure of the Google search engine. Neither the forward index table nor the backward index table of Google search engine contains any propensity value information. [0004] The patent number is ZL01109132.0, and the invention patent titled "Method for Judging the Positional Relevance of a Group of Query Keywords o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 杜一华
Owner SHANGHAI LAISEEK INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products