Diversified expansion method of keyword

A technology of keywords and extension methods, applied in the field of Web information retrieval, to achieve the effect of full coverage

Active Publication Date: 2014-04-23
TONGJI UNIV
View PDF6 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, this type of recommendation requires the system to extract high-frequency keywords in real time, which brings a certain load and delay to the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Diversified expansion method of keyword
  • Diversified expansion method of keyword
  • Diversified expansion method of keyword

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The technical solutions of the present invention will be further described in detail below with reference to the accompanying drawings.

[0025] The first step is to construct an index network based on web page classification according to the hyperlinks on the Internet. The construction steps of the index network are as follows: figure 1 shown.

[0026] (1) First, select the webpage classification system and its training set, and use the naive Bayesian algorithm to complete the training of the feature vectors of the webpage class. Specifically, we use the Chinese part of the dmoz manual classification directory (http: / / www.dmozdir.org / ), manually select 300 categories in the classification directory, and use the web pages they contain as the training set. After the training is completed, we use WorldNet to expand the synonyms of the feature words of the web page in order to obtain a more comprehensive feature vector. (2) Then, crawl the webpages on the Internet, and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method assisted for searching network information. Through the method, diversified expansion of a keyword can be realized; the method can be embedded into a plurality of web information service systems; the method is based on a simple webpage preprocessing and organizing mechanism; the method is capable of obtaining a diversified expansion word set in different ranges of the keywords, building an index network based on webpage classification according to a hyperlink of an internet and realizing diversified expansion of the keyword based on the built index network; and even if a user did not inquire the keyword or the field before, the expansion still can recommend most possible query semantics of the user.

Description

technical field [0001] The invention belongs to the field of Web information retrieval, and in particular relates to a keyword expansion method in Web information retrieval and Web information application. Background technique [0002] With the popularization of the Internet in people's daily life, the resources on the Internet are increasing exponentially. All kinds of information are scattered on the Internet. At present, most users use search engines to find information. However, relying on keyword matching technology to filter information makes the existing search engine technology have great limitations. One of shortcoming is: the quality of user service quality of search engine depends on the precision degree of the keyword of user input to a large extent. In fact, only a small number of users give accurate search terms at one time. Due to the differences in the user's prior knowledge and the user's expressive ability, in many cases, the user needs the search engine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 蒋昌俊陈闳中闫春钢丁志军王鹏伟孙海春
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products