Multi-modal image retrieval method and system for appearance patent

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal image and image retrieval technology, applied in still image data retrieval, metadata still image retrieval, neural learning methods, etc., can solve the problems of low retrieval accuracy and low retrieval efficiency, achieve good retrieval effect, improve The effect of efficiency and accuracy

Active Publication Date: 2020-08-28

GUANGDONG UNIV OF TECH

View PDF4 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] The present invention provides a multimodal image retrieval method and system for appearance patents to solve the problems of low retrieval efficiency and low retrieval accuracy in the existing multimodal image retrieval method for appearance patents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0051] A multimodal image retrieval method for appearance patents, such as figure 1 with 2 shown, including the following steps:

[0052] S1. Extract image features and text features of multiple views of appearance patents;

[0053] Among them, for the multi-view image features of the appearance patent, this embodiment uses the ResNet-based improved deep convolutional neural network Res2Net proposed by ShangHua Gao et al. in the 2019 CVPR paper to extract: z 1 ,z 2 ,…,z n , where n represents the number of views of the appearance patent, which may include left view, right view, front view, rear view, top view, three-dimensional figure 1 and three-dimensional figure 2 etc.; carry out weighted fusion to described image feature, obtain the image feature of multi-view fusion:

[0054]

[0055] i represents the i-th view of the design patent, and β represents the weight of the i-th view of the design patent. It should be noted that the weight ratio of the perspective view...

Embodiment 2

[0088] This embodiment provides a multimodal image retrieval system for appearance patents, such as Figure 4 shown, including:

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-modal image retrieval method and system for an appearance patent. Firstly, feature extraction and fusion are carried out on multiple views of an appearance patent, thenfeature extraction is carried out on a text, information of multiple modes is comprehensively considered, and finally deep visual semantic embedding is carried out, so that a good retrieval effect can be achieved in a large-scale appearance design patent database; for a tree structure in an ANN, compact coding representation is not performed on data so that efficiency is not high. Calculation ofthe Hamming distance in the hash method is not an accurate distance calculation problem. According to the invention, distance coding product quantization is provided, in the coding process, data points are coded into series connection of subspace clustering indexes, the distance between each data point and a reconstructed coded representation of the data point is coded, and an effective compact coded representation of each datum is formed; and therefore, the retrieval efficiency and accuracy are improved.

Description

technical field [0001] The invention relates to the technical field of image retrieval, in particular to a multimodal image retrieval method and system for appearance patents. Background technique [0002] Since images are the main content of design patents, the key technology for searching design patents is the core technology of image search. However, the design patent not only contains multiple view information of the patented design, but also contains related brief descriptions and other text information, such as the text description for the chair "This is a wooden rectangular dining table and chair with rounded corners", etc. Wait. Therefore, how to make good use of the text information of design patents for multimodal retrieval to optimize the retrieval effect is a problem of practical significance. [0003] In recent years, many scholars have invested in the multimodal learning technology because of its many modalities and rich information. However, how to integrat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F16/55G06F16/58G06F16/583G06K9/62G06N3/04G06N3/08

CPCG06F16/583G06F16/5866G06F16/55G06N3/08G06N3/045G06F18/23Y02D10/00

Inventor叶街林杨志景谭俊鹏

OwnerGUANGDONG UNIV OF TECH

Multi-modal image retrieval method and system for appearance patent

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology