Indonesian-English cross-language retrieval method based on weighted association rule postpart mining

A weighted association and cross-language technology, applied in the field of information retrieval, can solve problems such as word mismatch and query topic drift, and achieve the effect of improving performance and solving query topic drift and word mismatch problems

Inactive Publication Date: 2019-04-05
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The invention proposes an Indonesian-English cross-language retrieval method based on weighted association rule consequential mining, which is suitable for fields such as cross-language information retrieval and search engines, can improve the performance of cross-language information retrieval, and solve query topics in cross-language information retrieval Drift and word mismatch issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indonesian-English cross-language retrieval method based on weighted association rule postpart mining
  • Indonesian-English cross-language retrieval method based on weighted association rule postpart mining
  • Indonesian-English cross-language retrieval method based on weighted association rule postpart mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to better illustrate the technical solution of the present invention, the relevant concepts involved in the present invention are introduced as follows:

[0058] 1. Antecedents and postconditions of feature word association rules: Let x and y be any set of feature word items, and the implication of the form x → y is called feature word association rule, where x is called the antecedent of the rule, y is called the consequent of the rule.

[0059] 2. Postpart extension:

[0060] The extension of the consequent means that the expanded words come from the consequent item set of the weighted association rule, and the antecedent of the weighted association rule must be a post-translated query term set.

[0061] 3. Feature word item set support

[0062] Assume that the cross-language preliminary inspection related document set is composed of d 1 , d 2 ,...,d n and other documents, and the feature words of each document are expressed as t 1 ,t 2 ,...,t m , and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an Indonesian-English cross-language retrieval method based on weighted association rule postpart mining which comprises the following steps: translating an Indonesian user query machine into English and retrieving English documents, and constructing an initial examination related document set; Combining the translated original query lexical items, fusing the weight and frequency of the item set with the total weight of the feature words and the total number of the documents of the initially-detected user related feedback English document set, and adopting support degree-to-support degree-to-support degree-to-support degree; adopting a Confidence- correlation coefficient evaluation framework for mining feature word weighting association rules of original query lexical items of which the precursors are translated from the initial inspection correlation document set, extracting the parts after the weighting association rules as translated extension words, and combining the extension words with the translated original query lexical items to form new queries to retrieve English documents again; And translating the final retrieval result English document into anIndonesian language file and returning the Indonesian language file to the user; mining Extended words related to original query, pruning the feature word candidate item set through item weight sorting, the mining efficiency is improved, and Indonesian-is improved and promoted. And the English cross-language information retrieval performance is realized.

Description

technical field [0001] The invention belongs to the field of information retrieval, in particular to an Indonesian-English cross-language retrieval method based on weighted association rule consequence mining. Background technique [0002] Cross-language information retrieval refers to the retrieval technology of retrieving information resources in another language or multiple languages ​​with a query in one language by means of machine translation tools. Indonesian-English cross-language information retrieval refers to searching English documents with Indonesian user queries. With the rapid development of network technology and machine translation technology, cross-language information retrieval technology has been widely concerned and discussed. Scholars have conducted in-depth discussions and research on cross-language information retrieval models and algorithms from different angles and directions, and achieved great achievements. Rich results, however, current research...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products