Supercharge Your Innovation With Domain-Expert AI Agents!

Method for improving searching engine based on keyword index using phrase index technique

A search engine and index technology, applied in the field of back-end processing, can solve problems such as flooding of useless results, ambiguous query matching, and inability to reflect keyword correlation query results well, achieving accurate possible intentions and high ranking scientific effect

Inactive Publication Date: 2008-06-18
新百丽鞋业(深圳)有限公司 +1
View PDF0 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] 2. The relevance of each keyword cannot be well reflected in the query results
[0017] To sum up, the existing search engines are fuzzy in matching complete queries, which is conducive to getting more results, but it leads to a lot of useless results, and even interferes with the position of better results, and these Search engines do not do special processing for questions, and the effect is relatively poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for improving searching engine based on keyword index using phrase index technique
  • Method for improving searching engine based on keyword index using phrase index technique
  • Method for improving searching engine based on keyword index using phrase index technique

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0099] Example 1: A common phrase composed of a single keyword and its simple logical combination

[0100] Because a single keyword and its simple logical combination can often find the corresponding text in the original text of the webpage text, the advantages of the traditional search engine after the improvement of the present invention are not very prominent, and the results before and after the improvement are basically the same, and no examples are given here.

[0101] Example 2. Multi-keywords represent the average result of complex semantic search Search the following phrases in SS, Google, and Baidu respectively.

[0102] Chinese Valentine's Day gift

[0103] Number of New Year's Day holidays

[0104] Zhang Yimou's latest movie

[0105] Academic Affairs Office of Lanzhou University, Gansu Province

[0106] color of sodium carbonate

[0107] Causes of Teeth Grinding While Sleeping

[0108] Universities in Western China

[0109] Origin of Oolong Tea

[0110] Long...

example 3

[0113] Example 3. Average results of question search

[0114] Search in SS, Google, Baidu for the following phrases

[0115] How did the Spring Festival come about?

[0116] how to calculate energy band

[0117] Is there a flight from Lanzhou to Xi'an?

[0118] What is the principle of firefly light?

[0119] How can I use gas safely?

[0120] Why are college students not reused?

[0121] How many days are New Year's Day legal holidays?

[0122] Who is the director of Memoirs of a Geisha?

[0123] The result is as Figure 8 :

[0124] summary:

[0125] □Search results with complex semantics

[0126] ■The position of the first best result (the smaller the better)

[0127] □SS

[0128] The number of good results among the top ten results (bigger is better)

[0129] □SS>Google>Baidu

[0130] The number of good results among the top twenty results (bigger is better)

[0131] □SS>Baidu>Google

[0132] □Search results for questions

[0133] ■The positio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for improving a search engine which is based on keyword index by using phrase indexing technology; after receiving the user query sent by users, the invention first preprocesses the query, then sends the query to a query analysis module, an interface of the search engine and a web page data processing module respectively, and the generation of phrase can be fulfilled by the query analysis module respectively; the interface of the search engine and the web page data processing module take web page data from a traditional search engine, carry out data processing of web text so as to generate inverted lists; then the phrase generated by the query analysis module is subject to retrieval and matching in the inverted lists, which are obtained from the interface of the search engine and the web page data processing module, through a retrieval ordering module, meanwhile, based on an original ordering given by the search engine, the invention can carry out regulation on the original ordering according to the matching degree of the phrase; finally, the final result is returned to a client and the automatic summarization of the web page can be given simultaneously. The invention has higher sequencing scientificity.

Description

technical field [0001] The invention is a back-end processing technology of a general search engine realized by using the phrase index technology, which helps users obtain more desired results by reasonably screening and sorting the original search results. Background technique [0002] Search engine is a tool for searching web pages and websites. It has become an indispensable part of our "network life". It is an important way for us to find information, obtain information and learn knowledge on the Internet. The basic principle of the current general search engine is through the collection program of the website or webpage (that is, a general search engine based on keyword indexing, and the database of the general search engine relies on a "network robot (Spider)" or "web spider ( Crawlers)” software, which automatically obtains a large amount of web page information content through various links on the Internet, and analyzes and organizes it according to the established r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 邓剑波戴云川詹天荣张潘高潮周波张森胡显如
Owner 新百丽鞋业(深圳)有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More