Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Indonesia-English cross-language retrieval based on association rule antecedent and post-translation extension

A cross-language and rule-based technology, applied in the field of information retrieval, can solve problems such as word mismatch and query topic drift

Inactive Publication Date: 2019-03-29
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention proposes an Indonesian-English cross-language retrieval method based on antecedents and post-translation extensions of association rules, which is applicable to the field of cross-language information retrieval, improves the performance of cross-language information retrieval, and solves query subject drift and words in cross-language information retrieval. mismatch problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indonesia-English cross-language retrieval based on association rule antecedent and post-translation extension
  • Indonesia-English cross-language retrieval based on association rule antecedent and post-translation extension
  • Indonesia-English cross-language retrieval based on association rule antecedent and post-translation extension

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to better illustrate the technical solution of the present invention, the relevant concepts involved in the present invention are introduced as follows:

[0057] 1. Antecedents and postconditions of feature word association rules: Let x and y be any set of feature word items, and the implication of the form x → y is called feature word association rule, where x is called the antecedent of the rule, y is called the consequent of the rule.

[0058] 2. Front piece extension:

[0059] The antecedent expansion refers to the expansion of words from the weighted association rule antecedent item sets whose subsequent items are post-translated query term sets. Specifically, those subsequent items are post-translated The feature word association rules of the original query item set, extracting the antecedents of these rules as expansion words, the expansion words and the translated original query items are combined into a new query, and the English documents are retrieved...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an Indonesia-English cross-language retrieval based on association rule antecedent and post-translation extension. At first, that query machine is used to translat the Indonesia into English and the English document are retrieved, and a user-related document set is constructed. Then, the weight and frequency of the item set are fuse with the total weight and the total number of documents of the user-related document set, the frequent item set of the feature words is mined, the feature word item set is pruned through the item weight sorting, and finally, the confidence degree is adopted. Finally, the confidence degree correlation coefficient evaluation framework is used to mine the weighted association rules from the frequent items of feature words, the latter is thepredecessor of the weighted association rules of the original query term, the predecessor extension and the original post-translation query. The word combination is the new post-translation query toretrieve the English document again to obtain the final search result English document, and the final English translation of the original search result is translated into an Indonesian document by machine translation and returned to the user. The invention adopts the weighted association rule antecedent expansion to mine the expansion words related to the original query, and improves and improvesthe cross-language information retrieval performance.

Description

technical field [0001] The invention belongs to the field of information retrieval, specifically an Indonesian-English cross-language retrieval method based on antecedents of association rules and post-translation extensions. Background technique [0002] Cross-language information retrieval refers to the retrieval technology of retrieving information resources in another language or multiple languages ​​with a query in one language by means of machine translation tools. Indonesian-English cross-language information retrieval refers to a retrieval technique that uses Indonesian language user queries to retrieve English documents. The problems existing in the current cross-language information retrieval technology are serious drift of query topics and word mismatch, which often lead to low performance of cross-language information retrieval. [0003] With the rapid development of network technology and machine translation technology, cross-language information retrieval tech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F16/332G06F16/335
CPCG06F40/58
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products