Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Indonesian-Chinese cross-language retrieval method and system integrating association model and user feedback

An association mode, cross-language technology, applied in digital data information retrieval, special data processing applications, instruments, etc., can solve problems such as poor single-language retrieval performance, low cross-language retrieval performance, query topic drift, etc., to avoid serious topics. Drift problem, improved retrieval performance, effect of improved retrieval performance

Inactive Publication Date: 2019-03-15
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Scholars from all over the world have conducted in-depth discussions and research on cross-lingual information retrieval methods and systems from different angles and directions, and have achieved rich results. However, the current problems in cross-lingual information retrieval research have not been completely resolved. One of the problems that have been solved and paid more attention to is the serious query topic drift problem in the process of cross-language information retrieval, which is more serious than single-language retrieval. The problem of word mismatch, these problems often lead to low performance of cross-language retrieval, Not as good as monolingual retrieval performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indonesian-Chinese cross-language retrieval method and system integrating association model and user feedback
  • Indonesian-Chinese cross-language retrieval method and system integrating association model and user feedback
  • Indonesian-Chinese cross-language retrieval method and system integrating association model and user feedback

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The technical solution of the present invention will be described in further non-limiting detail below in conjunction with the embodiments and accompanying drawings.

[0055] One, in order to better illustrate the technical scheme of the present invention, the relevant concepts involved in the present invention are introduced as follows below:

[0056] Assume that the target language (TargetLanguage, TL) first-check related document set obtained after the user query is cross-language initial search and user-related feedback is TLdoc={tld 1 ,tld 2 ,...,tld n},tld i (1≦i≦n) indicates the i-th document in the target language document set TLdoc, tld j ={t 1 ,t 2 ,...,t m ,...,t p},t m (m=1,2,...,p) is called the target language feature term item (Feature-term Item, FTI), referred to as the feature item, generally composed of words, words or phrases, tld i The corresponding feature item weight set W in i ={w i1 ,w i2 ,...,w im ,...,w ip},w im tld for the i-th ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating an association pattern with user feedback. The method comprises the following steps: utilizing a machine translation module to translate an Indonesian user query into a Chinese query, and submitting the Chinese query to a search engine module to carry out retrieval to obtain an initial retrieval result document set; utilizing the relevant feedback information extraction module of a user clicking behavior to obtain a user feedback initial retrieval related document set; carrying out preprocessing through a document preprocessing module to obtain an initial retrieval related document database; calling an all-weighted association rule mining module to construct an all-weighted association rule base; utilizing a cross-linguistic query expansion word generation module to establish an expansion word bank; utilizing a cross-linguistic query expansion implementation module to submit a new combined query to the search engine module to obtain a final retrieval result Chinese document; and utilizing a final result display module to submit a final retrieval result to the machine translation module, translating the final retrieval result into an Indonesian document, and returning the Indonesian document to the user. By use of the method and the system, cross-linguistic retrieval performance can be effectively enhanced and improved, and the method and the system have a good practical application value and popularization prospect.

Description

technical field [0001] The invention belongs to the field of text information retrieval, and specifically relates to an Indonesian-Chinese cross-language retrieval method and system that integrates association patterns and user feedback, and is applicable to fields such as cross-language text information retrieval that uses Indonesian language to query and retrieve Chinese documents. Background technique [0002] Cross-language information retrieval refers to the technology of retrieving information resources in other languages ​​with a query in one language. The Indonesian-Chinese cross-language information retrieval method is to query and retrieve Chinese documents in Indonesian language. The Indonesian language used to express the query is called the source language, and the Chinese language of the retrieved documents is called the target language. With the increasingly close exchanges between China and ASEAN countries, the research on cross-lingual information retrieval ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/332
CPCG06F16/3326G06F16/3337
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products