Patent text retrieval method and device based on multi-modal matrix vector representation

A matrix-vector, multi-modal technology, applied in unstructured text data retrieval, neural learning methods, text database query, etc., can solve information dispersion, vector representation cannot contain keyword information, and related technologies cannot be accurately represented, etc. problems to achieve precise results

Active Publication Date: 2022-07-22
CHENGDU UNIV OF INFORMATION TECH +2
View PDF17 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This kind of vector representation is difficult to accurately represent the scattered information in the document and involves more technologies, and the combination of different word vectors on the same dimension may cause mutual cancellation, and the final vector representation cannot contain all keywords. word information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Patent text retrieval method and device based on multi-modal matrix vector representation
  • Patent text retrieval method and device based on multi-modal matrix vector representation
  • Patent text retrieval method and device based on multi-modal matrix vector representation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The technical solutions in the embodiments of the present invention will be clearly and completely described below. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0018] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the invention.

[0019] The present invention will now be further described with reference to the accompanying drawings.

[0020] figure 1 A flow chart o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a patent text retrieval method and device based on multi-modal matrix vector representation, a word vector set of all words is obtained through training according to an existing patent data set, so that word vectors can contain information of all keywords, an image vectorization representation model is obtained through training of the existing patent data set, and the retrieval efficiency is improved. According to the method, the attached drawings in the patent are extracted, the graph vectors corresponding to the attached drawings are obtained, the graph vectors and the word vectors are combined, when the patent is retrieved, a large amount of useful information contained in the attached drawings in the patent is fully utilized, and meanwhile, some retrieval requirements of searching for a text through a graph, searching for a graph through a text and searching for a graph through a graph in the current market are met; and the patent retrieval result is more accurate.

Description

technical field [0001] The invention relates to the technical field of text retrieval, in particular to a patent text retrieval method and device based on multimodal matrix vector representation. Background technique [0002] Traditional text retrieval is completed through regularization matching. When the user uses synonyms or words with similar meanings to the key words in the document to search, the records will not be retrieved. Moreover, the algorithm based on the LDA topic model trains a large-scale document corpus in an unsupervised manner, so that the topic model of each document can be obtained, so that the retrieval based on the document topic can be completed. [0003] In recent years, semantic retrieval technology based on word vectors has emerged, which can be obtained through unsupervised training of massive texts. word2vec is powerful in capturing lexical relationships between words, but the resulting vectors are largely uninterpretable and difficult to chara...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F17/16G06F40/289G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F16/3344G06F40/289G06F40/30G06N3/08G06F17/16G06N3/044G06N3/045G06F18/22
Inventor 许林李一君郑倩蒋涛刘甲甲袁建英谢昱锐
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products