Automatic error correction method for Chinese search term of search engine

An automatic error correction, search engine technology, applied in network data indexing, network data retrieval, other database retrieval and other directions, can solve the problem of word meaning change, large escape risk, complex query error correction technology, etc., to achieve the scope of error correction Wide range of effects with high success rate and wide range of error correction scenarios

Inactive Publication Date: 2016-11-09
DATAGRAND TECH INC
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Chinese characters also have problems similar to Real-word Error, such as salary increase imperial decree, salary increase and imperial decree are both correct words, but there is indeed a problem with the two together, so in many cases Chinese query error correction is actually a phrase error correction problem
[0007] 2. Query error correction technology is complicated:
In addition, since Chinese words are often short, the difference of one word may completely change the meaning of the word, so the single use of error correction methods such as edit distance often leads to a greater risk of escaping

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic error correction method for Chinese search term of search engine
  • Automatic error correction method for Chinese search term of search engine
  • Automatic error correction method for Chinese search term of search engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below in conjunction with the accompanying drawings.

[0037] Such as figure 2 Shown: The automatic error correction method for Chinese search words of a search engine, including a data module, an offline database building terminal and an online retrieval terminal. The main function of the data module is to provide data for the subsequent offline database building terminal and online retrieval terminal.

[0038] The search engine’s automatic correction method for Chinese search words is as follows:

[0039] Such as image 3 Shown: The data module regularly extracts and counts the search logs, gives the query frequency information, tries to segment the query, counts the df and idf information of each word, organizes the database information, and crawls the words of high-quality websites through the crawler system Items, tags, etc., and organize high-quality dictionaries related to existing nlp data; there are four main s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an automatic error correction method for a Chinese search term of a search engine and belongs to the technical field of computer applications. According to the automatic error correction method, a data module, an offline database construction end and an online retrieval end are included, wherein the data module is mainly used for providing data for the offline database construction end and the online retrieval end. Abundant offline data are mined by using modules, such as a search log, a crawler system and the like and are used for various error correction strategies. For different fields, a dictionary in an exclusive field is used for system error correction. According to the automatic error correction method for the Chinese search term of the search engine, a manner of combining various independent error correction strategies is adopted, various strategies supplement and compare with each other for a complex query error, and a good result is finally achieved. As the error correction for a second time is employed, an error correction range is wide and the success rate is high. The error correction strategies can be flexibly and independently configured, an error correction occasion is wide, and the automatic error correction method can adapt to various different vertical search fields.

Description

technical field [0001] The invention relates to an automatic error correction method for Chinese search words of a search engine, and belongs to the technical field of computer applications. Background technique [0002] Today, search engines are one of the most important ways for people to obtain information. The most basic and core function of a search engine system is information retrieval, finding webpages or documents containing keywords, and then giving the results according to a certain order. In the search engine, we call the keyword information entered by the user query, and the user hopes to get a web page or document with better quality related to the input query. There are many ways to measure the word "good", and the simplest standard is those The most helpful and attractive results to users can be ranked first. However, due to various reasons, the quality of the query itself entered by the user is not high or wrong. If the search engine does not correct and m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F40/216G06F40/289
Inventor 高翔
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products