Zero sample image recognition method and recognition device thereof, medium and computer terminal

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A sample image and recognition method technology, applied in neural architecture, instruments, biological neural network models, etc., can solve the problems of incomplete category information of generated samples, inconsistent distribution of sample domains, etc., to speed up research and application, and increase class distinction , the effect of improving task performance

Pending Publication Date: 2022-07-29

ANHUI UNIVERSITY

View PDF1 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] In order to solve the technical problems of inconsistent sample domain distribution and incomplete generated sample category information in the generated model, the present invention provides a zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction, which is similar to the zero-sample image recognition method Corresponding zero-sample image recognition device, computer-readable storage medium and computer terminal using the zero-sample image recognition method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0112] The zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction of the present invention mainly has two stages: 1. a model training stage; 2. a zero-sample image recognition stage. Model training includes two parts: the training of the generative model and the training of the unseen class classifier. The first part is to obtain the trained feature generator. The second part first uses the trained feature generator and the semantic features of the unseen class to generate the unseen class. class visual features, and then train an unseen class classifier with these generated unseen class visual features. In the zero-sample image recognition stage, the unseen class image to be classified is sent to the trained unseen class classifier to identify the category to which the unseen class image belongs.

[0113] The zero-sample image recognition method can design a corresponding zero-sample image recognition device based on prototype ...

Embodiment 2

[0176] The zero-sample image recognition method in this embodiment is basically similar to that in Embodiment 1, and is used to recognize visual features of unseen classes. refer to Figure 4 , the method includes the following steps:

[0177] S1. Get the dataset.

[0178] S2. Extract visual features, semantic features and visual prototypes of visible class categories. Using the visual extractor and semantic extractor, the visual features and semantic features of the visible category images, and the unseen category semantic features are extracted. Design a prototype extractor to obtain visual prototypes for each visible class category.

[0179] S3. Use the visual features and semantic features of the visible category images and the visible category visual prototype to train the generative model to obtain a trained feature generator.

[0180] S4. Send the semantic features of the unseen category into the trained feature generator to obtain the generated visual features of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a zero sample image recognition method based on prototype domain alignment and cross-modal reconstruction. The method comprises the following main steps: extracting visual features of visible samples; obtaining visual distribution information of the visible images through a feature distribution encoder; obtaining generated visual features through a feature generator; training and optimizing parameters of the model of the feature generator by designing distribution regularization loss, domain consistency loss, visual reconstruction loss, adversarial loss and semantic reconstruction loss to obtain a trained feature generator; inputting the unseen class semantic features into a trained feature generator to obtain generated unseen class visual features; training an unseen class classifier by using the generated unseen class visual features; and utilizing the trained unseen class classifier to predict the unseen class image. According to the method, through prototype domain consistency alignment and cross-modal reconstruction, the generated visual features are closer to real visual features, and more category discrimination features are included.

Description

technical field [0001] The invention relates to a zero-sample image recognition method in the field of computer vision image recognition, in particular to a zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction, and a zero-sample image recognition method corresponding to the zero-sample image recognition method. A zero-sample image recognition device, a computer-readable storage medium and a computer terminal using the zero-sample image recognition method. Background technique [0002] Existing image recognition methods need to collect a large number of images with category labels to train the model in the model training stage, and the recognition stage can only identify the categories that have appeared in the training stage. However, in practical scenarios, it is often necessary to identify categories that lack images in the training phase, such as images of endangered species, medical tumor images, etc. In the above appli...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06V10/764G06V10/774G06N3/04

CPCG06V10/764G06V10/774G06N3/045

Inventor 赵鹏刘金辉韩莉

Owner ANHUI UNIVERSITY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Zero sample image recognition method and recognition device thereof, medium and computer terminal

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology