Chinese query extension method based on deep learning and extension word mining intersection fusion

A technology of deep learning and extension methods, applied in the field of Chinese query expansion

Inactive Publication Date: 2020-11-06
GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to propose a Chinese query expansion method for the intersection fusion of deep learning and expanded word mining, and use the method in the field of information retrieval, such as actual Chinese search engines and web information retrieval systems, to improve and enhance the performance of information retrieval systems Query performance, reducing query subject drift and word mismatch problems in information retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese query extension method based on deep learning and extension word mining intersection fusion
  • Chinese query extension method based on deep learning and extension word mining intersection fusion
  • Chinese query extension method based on deep learning and extension word mining intersection fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] 1. In order to better illustrate the technical solution of the present invention, the related concepts involved in the present invention are introduced as follows:

[0059] 1. Itemset

[0060] In text mining, a text document is regarded as a transaction, each feature word in the document is called an item, the collection of feature word items is called an itemset, and the number of all items in an itemset is called the itemset length. k_itemsets refer to itemsets containing k items, where k is the length of the itemsets.

[0061] 2. Antecedents and Consequences of Association Rules

[0062] Let x and y be an arbitrary set of feature terms, and the implication in the form of x→y is called an association rule, where x is called the antecedent of the rule, and y is called the consequent of the rule.

[0063] 3. Rule expansion words

[0064] The rule expansion word means that the expansion word is derived from the association rule consequent item set, and the antecedent it...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a Chinese query extension method based on deep learning and extension word mining intersection fusion. The method comprises the following steps: carrying out word embedding semantic learning training on an initial detection document set by adopting a deep learning tool; obtaining a word embedding extension word set with rich context semantic information; then, mining an association rule mode for the initially-detected front-column pseudo-correlation feedback document set by utilizing a Copulas-theory-based pseudo-correlation feedback extension word mining method; and obtaining a rule extension word set containing feature inter-word association information based on statistical analysis, and finally embedding words into the extension word set and the rule extension word set for intersection fusion to obtain a final extension word set so as to improve the extension word quality. According to the method, deep learning and extension word mining intersection are fused,high-quality extension words related to original query are mined, the problems of query topic drifting and word mismatching can be restrained, the text information retrieval performance is improved,and the method has good application value and popularization prospects.

Description

technical field [0001] The invention relates to a Chinese query expansion method integrating the intersection of deep learning and expanded word mining, and belongs to the technical field of information retrieval. Background technique [0002] Query expansion is one of the key technologies to solve the problem of query subject drift and word mismatch in information retrieval. Query expansion refers to modifying the original query weight or adding words related to the original query to obtain a new query that is longer than the original query. , in order to more completely and accurately describe the semantics or themes implied by the original query, make up for the lack of user query information, and improve the retrieval performance of the information retrieval system. The core problem of query expansion is the source of expansion words and the design of expansion model. With the development of network technology and the arrival of the era of big data, network users have m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/332
CPCG06F16/3325G06F16/3334G06F16/3335G06F16/3338G06F16/334
Inventor 黄名选
Owner GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products