Unlock instant, AI-driven research and patent intelligence for your innovation.

Indonesian-English cross-language post-translation part extension method based on item weight sorting mining

An extension method and item weight technology, which is applied in the field of information retrieval and can solve problems such as word mismatch and query topic drift.

Inactive Publication Date: 2019-04-05
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The invention proposes an Indonesian-English cross-language post-translation extension method based on item weight sorting and mining, which is applied to the field of cross-language information retrieval, applied to actual cross-language search engines and cross-language information retrieval systems, and improves cross-language information Retrieval performance, solving the problem of query subject drift and word mismatch in cross-language information retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indonesian-English cross-language post-translation part extension method based on item weight sorting mining
  • Indonesian-English cross-language post-translation part extension method based on item weight sorting mining
  • Indonesian-English cross-language post-translation part extension method based on item weight sorting mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to better illustrate the technical solution of the present invention, the relevant concepts involved in the present invention are introduced as follows:

[0057] 1. Antecedents and postconditions of feature word association rules: Let x and y be any set of feature word items, and the implication of the form x → y is called feature word association rule, where x is called the antecedent of the rule, y is called the consequent of the rule.

[0058] 2. Indonesian-English cross-language post-translation extension:

[0059] From the Indonesian-English cross-language retrieval preliminary results of the first related English documents, the antecedents are the English feature word association rules of the original query item set after translation, and the consequents of these rules are extracted as English expansion words, and the English expansion words and After translation, the original English query terms are combined into a new query, and the English documents are...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an Indonesian-based on item weight sorting mining. The invention discloses an English cross-language translated postpartum extension method. The method comprises the followingsteps: firstly, retrieving an English document for Indonesian language query in a cross-language manner; constructing an initially-detected user-related feedback English document set; then, in combination with the translated English original query lexical item, fusing the weight and frequency of the item set with the total weight of the feature words of the English document set fed back by the initial detection user and the total number of the documents; adopting A support degree-confidence coefficient-correlation coefficient evaluation framework to mine a feature word weighted association rule, wherein a feature word weighted association rule front part is an original query word item set, a feature word weighted association rule rear part is composed of non-query word items, the feature word weighted association rule rear part is extracted to serve as a post-translation expansion word, and Indonesia-correlation is achieved. And expanding the translated part of the English cross-language query. According to the method, the item set is pruned through item weight sorting, the mining efficiency is improved, the extension words related to the original query can be mined, and Indonesian-is achieved. British cross-language translated postpartum extension to improve and improve Indonesian- And the English cross-language information retrieval performance is realized.

Description

technical field [0001] The invention belongs to the field of information retrieval, in particular to an Indonesian-English cross-language post-translation extension method based on item weight sorting and mining. Background technique [0002] Cross-language query expansion refers to the process of using a certain strategy to find the expansion words related to the original query in the process of cross-language information retrieval, and then combining the expansion words and the original query to obtain a new query and re-retrieval. Cross-language query expansion is one of the key technologies to enhance and improve the performance of cross-language information retrieval, and it can solve the long-term problems in cross-language information retrieval, such as severe drift of query topics and word mismatch. According to different stages of cross-language information retrieval, cross-language query expansion can be divided into three types: query pre-translation expansion, qu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F16/332G06F16/335
CPCG06F40/58
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS