Method and device for mining synonyms

A technology of synonyms and synonym sets, applied in the computer field, can solve problems such as low efficiency

Active Publication Date: 2012-10-31
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing synonym mining method performs synonym mining by calculating the correlation probability between words in the corpus, but this method needs to calculate the words in the corpus in pairs, and the efficiency is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for mining synonyms
  • Method and device for mining synonyms
  • Method and device for mining synonyms

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0086] figure 1 It is a flow chart of the method provided by Embodiment 1 of the present invention. The method shown in this embodiment can be executed offline in the background by the server where the search engine is located, such as figure 1 As shown, the method may include the following steps:

[0087] Step 101: From the search log, the query and the title of the webpage clicked or browsed in the corresponding search results, or different queries corresponding to the title of the webpage clicked or browsed, obtain candidate resources of synonyms.

[0088] When a user enters a query and clicks or browses in the search results, usually the query and the title of the clicked or browsed web page will have a semantic relationship or even consistency, and the title of the clicked or browsed web page corresponding to the same query will It may also be semantically related or even consistent.

[0089] Furthermore, if different users input different queries, or the same user ente...

Embodiment 2

[0147] figure 2 The structural diagram of the mining device for synonyms provided in Embodiment 2 of the present invention, the device can be set on the server side where the search engine is located, such as figure 2 As shown, the apparatus may include: a candidate resource acquisition unit 200 and a synonym extraction unit 210 .

[0148] The candidate resource acquiring unit 200 acquires candidate resources of synonyms formed by phrase pairs from the search log, the title of the web page clicked or browsed in the query and its corresponding search results, or different queries corresponding to the title of the web page clicked or browsed .

[0149] Wherein, the candidate resource acquisition unit 200 may acquire candidate resources in any of the following ways or a combination of any ways:

[0150] Obtain the clicked or browsed webpage title in the search results corresponding to the query from the search log, and obtain the phrase pair (that is, the "query-title" pair) ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and device for mining synonyms. The method comprises the following steps of: searching a query and a webpage title which is clicked or browsed in a searching result corresponding to the query from a searching log, or allowing the clicked or browsed webpage title to correspond with different queries, and acquiring a candidate resource of a synonym formed by phrase pairs; and extracting a synonym from each phrase pair of the candidate resource, wherein the extracted synonym pair has the same context from the phrase pairs. According to the method and device for mining synonyms, the efficiency and accuracy of mining synonyms can be improved, and the mined synonyms can be in more accordance with the language characteristics of a search engine.

Description

【Technical field】 [0001] The invention relates to the field of computer technology, in particular to a method and device for mining synonyms. 【Background technique】 [0002] When a user is using a search engine to search, in order to recall webpages that match the synonyms of the query entered by the user in the search results, a synonym-based search request (query) extension will be used, that is, when using query While searching, the synonyms of query are also used for searching. In order to apply this technology in search engines, the mining of synonyms is a very important basic work. [0003] Existing synonym mining methods perform synonym mining by calculating the correlation probability between words in the corpus, but this method needs to calculate pairs of words in the corpus, and the efficiency is very low. 【Content of invention】 [0004] In view of this, the present invention provides a method and device for mining synonyms, so as to improve the efficiency of m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 徐文智赵世奇呼大为
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products