Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Post-translation expansion method for Chinese-English cross-language query based on fully weighted rule consequent

A fully weighted, cross-language technology, applied in the field of information retrieval, which can solve problems such as query subject drift, word mismatch, etc.

Active Publication Date: 2021-09-10
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Aiming at the problems existing in the above-mentioned prior art, the present invention proposes a Chinese-English cross-language query post-translation extension method based on fully weighted rule consequences, which can improve and improve cross-language retrieval performance, and solve query topics in cross-language information retrieval Drift and word mismatch problems are applicable to the field of cross-language information retrieval, and can also be applied to cross-language search engines to improve search engine retrieval performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Post-translation expansion method for Chinese-English cross-language query based on fully weighted rule consequent
  • Post-translation expansion method for Chinese-English cross-language query based on fully weighted rule consequent
  • Post-translation expansion method for Chinese-English cross-language query based on fully weighted rule consequent

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to better illustrate the technical solution of the present invention, the specific implementation manners of the present invention will be described in detail below in conjunction with the accompanying drawings, but this does not constitute a limitation to the protection scope of the claims of the present invention.

[0042] The relevant concepts involved in the present invention are introduced as follows:

[0043] 1. Antecedents and consequents of association rules: An implication of the form x→y is called an association rule, where x is called the antecedent of the rule and y is called the consequent of the rule.

[0044] 2. Fully weighted association model support of fusion item frequency and weight

[0045] In the study of association pattern mining, the core problem is the calculation of the support degree of association patterns. The present invention proposes a formula for calculating the support degree (awSup(I)) of the fully weighted association mode ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a post-translation extension method for Chinese-English cross-language query based on fully weighted rule-based post-translation. Firstly, the Chinese-English cross-language primary search is performed, and the first English documents in the first check are extracted, and the relevant document set of the first check is constructed and preprocessed by user correlation judgment. Finally, the fully weighted item set support calculation method that fuses the item set weight and frequency is used to mine the fully weighted frequent item set containing the translated original query terms from the relevant document set of the initial inspection, and the fully weighted confidence-interest evaluation based on the fully weighted confidence degree is used. The framework mines fully weighted frequent itemsets. The antecedent is the fully weighted association rule between English feature words of the translated original query term, and the subsequent item of the extraction rule is used as a Chinese-English cross-language post-translation expansion. The post-translation expansion and the post-translation original query The word combination retrieves the English document again for the new query. The method of the invention can enhance and improve the performance of cross-language information retrieval, reduce serious drift of query topics and word mismatch in cross-language information retrieval, and has high application value and broad application prospect.

Description

technical field [0001] The invention belongs to the field of information retrieval, in particular to a method for post-translation extension of Chinese-English cross-language query based on complete weighted rule consequences. Background technique [0002] Cross-lingual information retrieval refers to the technology of retrieving information resources in other languages ​​with a query in one language by means of machine translation. The language in which the user query is expressed is called the source language, and the language in which the documents are retrieved is called the target language. Cross-language information retrieval is affected by query translation quality and synonyms, polysemous words, etc., which often lead to serious drift of query topics, word mismatch, ambiguity and polysemy in translation of query items, etc. Cross-language query expansion is one of the key technologies to solve the above problems. Cross-language query expansion refers to the process...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2452
CPCG06F16/3338
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products