An extension method of antecedents for Chinese-English cross-language query based on matrix weighted association rules

A matrix weighted, cross-lingual technology, applied in text database query, unstructured text data retrieval, special data processing applications, etc.

Active Publication Date: 2021-09-10
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention proposes a Chinese-English cross-language query antecedent extension method based on matrix weighted association rules, which is applicable to the field of cross-language information retrieval, can effectively reduce problems such as query subject drift and word mismatch in cross-language information retrieval, and improve and Improve cross-language search performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An extension method of antecedents for Chinese-English cross-language query based on matrix weighted association rules
  • An extension method of antecedents for Chinese-English cross-language query based on matrix weighted association rules
  • An extension method of antecedents for Chinese-English cross-language query based on matrix weighted association rules

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to better illustrate the technical solution of the present invention, the specific implementation manners of the present invention will be described in detail below in conjunction with the accompanying drawings, but this does not constitute a limitation to the protection scope of the claims of the present invention.

[0044] The relevant concepts involved in the present invention are introduced as follows:

[0045] 1. Chinese-English cross-language query post-translation antecedent expansion

[0046] From the top related English documents in the preliminary search results of Chinese-English cross-language retrieval, those consequences are the association rules of the original query terms after translation, and the antecedents of these rules are extracted as expansion words, and the expansion words are combined with the original query terms after translation to form a new query. , to retrieve English documents again in order to improve retrieval performance. Thi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese-English cross-language query antecedent extension method based on matrix weighted association rules. Firstly, by means of machine translation, the Chinese query formula is translated into English and English documents are retrieved. The user performs a correlation judgment on the first-listed English documents to obtain the preliminary check. For related English document sets, the matrix weighted association pattern support calculation method based on item frequency and weight and the matrix weighted association pattern mining method based on support degree-confidence degree-interest degree are used to mine the relevant English document set for the initial inspection. The matrix weighted association rules of the original query terms after translation, and the antecedents of these association rules are extracted as cross-language post-translation expansion words to realize the expansion of post-translation antecedents of Chinese-English cross-language queries. Experimental results show that the invention can effectively reduce the long-standing problems of serious drift of query topics and word mismatches in cross-language information retrieval, improve and improve the performance of cross-language information retrieval, and has good application value and promotion prospect.

Description

technical field [0001] The invention belongs to the field of network information retrieval, in particular to a Chinese-English cross-language query antecedent extension method based on matrix weighted association rules. Background technique [0002] With the popularization of Internet technology, network information resources with multilingual characteristics have grown rapidly and become network big data with huge economic value and research value. How to retrieve information resources in other languages ​​from big data resources with familiar query language expressions to meet more information needs, and make cross-language information retrieval technology an urgently needed technology for current network users. The process of cross-language information retrieval is more complicated than that of single-language retrieval, and the problems encountered are more serious. The main manifestations are: affected by the translation quality, the query subject drifts seriously, the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33
CPCG06F16/3332G06F16/3338
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products