Machine translation for query expansion

A statistical machine translation and translation technology, applied in the field of search query expansion, which can solve the problems of search result identification, irrelevant shipping boxes, etc.

Inactive Publication Date: 2010-11-03
GOOGLE LLC
View PDF0 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Expanding queries with synonyms that are inconsistent with the user's intended meaning may lead to the identification of irrelevant search results
For example, a search result for a fishing trawler might not be relevant for shipping boxes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine translation for query expansion
  • Machine translation for query expansion
  • Machine translation for query expansion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] figure 1 is a diagram of an example statistical machine translation system 100 . Statistical machine translation system 100 is used to translate a sequence of input words in a source language into a sequence of translated words in a target language. Statistical machine translation depends on statistical models based on prior probabilities and statistical correlations between occurrences of words in a training corpus. Conventional applications of statistical machine translation assume that both the source and target languages ​​are different natural languages ​​(eg, French, English, German, or Arabic). In principle, however, the natural language used as input and the natural language provided as output need not be different.

[0020] Statistical machine translation system 100 includes two distinct models: language model 117 and translation model 113 . Language model 117 is used in machine translation to determine whether a passage of text is likely to be in a target l...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Methods, systems and apparatus, including computer program products, for expanding search queries. One method includes receiving a search query, selecting a synonym of a term in the search query based on a context of occurrence of the term in the received search query, the synonym having been derived from statistical machine translation of the term, and expanding the received search query with the synonym and using the expanded search query to search a collection of documents. Alternatively, another method includes receiving a request to search a corpus of documents, the request specifying a search query, using statistical machine translation to translate the specified search query into an expanded search query, the specified search query and the expanded search query being in the same natural language, and in response to the request, using the expanded search query to search a collection of documents.

Description

technical field [0001] This specification deals with search query expansion. Background technique [0002] Query expansion refers to modifying the search query received from the user before performing the search. Ideally, the modified search query will produce improved search results compared to the original query. Typical methods for query expansion include stemming of words, correction of misspellings, and augmentation of search queries, such as using synonyms of words that appeared in the original query. [0003] There are many methods of query expansion using synonyms. For example, synonyms of words can be identified from expert-specified thesaurus or lexical ontologies. In some systems, synonyms are identified from other search queries that are syntactically similar to the original query. Synonym selection is especially challenging when a word may have multiple potential synonyms, each with widely varying meanings. For example, in the query "How to ship a box", the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
CPCG06F17/30672G06F16/3338
Inventor 斯特凡·里茨勒亚历山大·L·瓦谢尔曼
Owner GOOGLE LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products