Unlock instant, AI-driven research and patent intelligence for your innovation.

Systems, methods, software, and interfaces for multilingual information retrieval

A language and target language technology, applied in the field of information retrieval, which can solve the problems of complex effective search, impact on meaning, interference, etc.

Inactive Publication Date: 2009-05-27
THOMSON REUTERS ENTERPRISE CENT GMBH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In such cases, the challenge of effective search is compounded because in non-English languages, nouns can be masculine, feminine, or neuter; verbs change forms to indicate numbers (single or multiple), display tense (present, past, future, etc.), and revealing persons (first-person "I," second-person "you," and third-person "he / she / it"); adjectives change form according to the noun they modify; and punctuation marks ( such as grave accents or other diacritics) greatly affect the meaning
While stemming addresses these complexities in monolingual search, stemming by itself does not address the added complexity of linguistic conflicts between languages, and can even interfere in some cases.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systems, methods, software, and interfaces for multilingual information retrieval
  • Systems, methods, software, and interfaces for multilingual information retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0011] This specification describes one or more specific embodiments of the present invention with reference to and in conjunction with the above-mentioned drawings. These examples are provided not to limit the invention, but to illustrate and teach the invention, shown and described in sufficient detail to enable any person skilled in the art to make or practice the invention. In order to avoid unnecessarily obscuring the present invention, the description may omit certain information well known to those skilled in the art.

[0012] Demonstrative Multilingual Information Retrieval System

[0013] figure 1 An exemplary online multilingual information retrieval system 100 is shown that incorporates the teachings of the present invention. System 100 includes one or more databases 110 , one or more servers 120 , and one or more access devices 130 .

[0014] Database 110 includes a set of multilingual documents 112 and a corresponding set of monolingual indexes 114 .

[0015] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present inventors have a devised one or more novel methods, systems, and interfaces for facilitating multi-lingual searches. One exemplary method entails creating multiple language-specific indices for a collection of documents, with each index including stemmed and non-stemmed versions of terms from the documents. Users submit queries that are associated with a set of one or more target languages. Query processing entails translating original and stemmed versions of each term in a query into each of the target languages, using one or more techniques that each yield a set of potentially equivalent query terms. Each set of potentially equivalent query terms is then processed against the corresponding language-specific index, using a conventional monolingual search technique, such as a Boolean or natural language query, to identify documents from the collection. The resultant documents are presented to the user in language groupings or by computed relevance.

Description

technical field [0001] This application claims priority to US Provisional Application 60 / 641,669, filed January 4, 2005, which is hereby incorporated by reference. [0002] Various embodiments of the present invention relate to information retrieval, particularly multilingual or cross-lingual information retrieval systems, methods and software. Background technique [0003] The importance of search engine technology has grown considerably over the past decade or so, reflected in the expansion and use of the Internet. When a user clicks the search button, the search engine searches through tens of millions of items to find an item and corresponding document that satisfies the query. However, this superficial simplicity conceals the complexity of the underlying search technology, because a good search engine generally does not stop at simple matching of query items. [0004] To understand this complexity, consider that search engines generally fall into two categories: monol...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F17/30616G06F17/30864G06F17/30669G06F16/313G06F16/3337G06F16/951
Inventor I·穆利尼耶E·S·伦德
Owner THOMSON REUTERS ENTERPRISE CENT GMBH