Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating association pattern with user feedback

An association mode, cross-language technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of low cross-language retrieval performance, inferior to single-language retrieval performance, query topic drift, etc., to avoid serious topics. Drift problem, improved retrieval performance, effect of improved retrieval performance

Inactive Publication Date: 2017-03-08
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Scholars from all over the world have conducted in-depth discussions and research on cross-lingual information retrieval methods and systems from different angles and directions, and have achieved rich results. However, the current problems in cross-lingual information retrieval research have not been completely resolved. One of the problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating association pattern with user feedback
  • Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating association pattern with user feedback
  • Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating association pattern with user feedback

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The technical solution of the present invention will be described in further non-limiting detail below in conjunction with the embodiments and accompanying drawings.

[0055] One, in order to better illustrate the technical scheme of the present invention, the relevant concepts involved in the present invention are introduced as follows below:

[0056] Assume that the target language (TargetLanguage, TL) initial inspection related document set obtained by the user query after the initial cross-language retrieval and user-related feedback is TLdoc={tld 1 ,tld 2 ,...,tld n},tld i (1≦i≦n) indicates the i-th document in the target language document set TLdoc, tld j ={t 1 ,t 2 ,...,t m ,...,t p},t m (m=1,2,...,p) is called the target language feature term item (Feature-term Item, FTI), referred to as the feature item, generally composed of words, words or phrases, tld i The corresponding feature item weight set W in i ={w i1 ,w i2 ,...,w im ,...,w ip},w im tld...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an Indonesian-Chinese cross-linguistic retrieval method and system capable of integrating an association pattern with user feedback. The method comprises the following steps: utilizing a machine translation module to translate an Indonesian user query into a Chinese query, and submitting the Chinese query to a search engine module to carry out retrieval to obtain an initial retrieval result document set; utilizing the relevant feedback information extraction module of a user clicking behavior to obtain a user feedback initial retrieval related document set; carrying out preprocessing through a document preprocessing module to obtain an initial retrieval related document database; calling an all-weighted association rule mining module to construct an all-weighted association rule base; utilizing a cross-linguistic query expansion word generation module to establish an expansion word bank; utilizing a cross-linguistic query expansion implementation module to submit a new combined query to the search engine module to obtain a final retrieval result Chinese document; and utilizing a final result display module to submit a final retrieval result to the machine translation module, translating the final retrieval result into an Indonesian document, and returning the Indonesian document to the user. By use of the method and the system, cross-linguistic retrieval performance can be effectively enhanced and improved, and the method and the system have a good practical application value and popularization prospect.

Description

technical field [0001] The invention belongs to the field of text information retrieval, and specifically relates to an Indonesian-Chinese cross-language retrieval method and system that integrates association patterns and user feedback, and is applicable to fields such as cross-language text information retrieval that uses Indonesian language to query and retrieve Chinese documents. Background technique [0002] Cross-language information retrieval refers to the technology of retrieving information resources in other languages ​​with a query in one language. The Indonesian-Chinese cross-language information retrieval method is to query and retrieve Chinese documents in Indonesian language. The Indonesian language used to express the query is called the source language, and the Chinese language of the retrieved documents is called the target language. With the increasingly close exchanges between China and ASEAN countries, the research on cross-lingual information retrieval ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3326G06F16/3337
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products