Zero sample image recognition method and recognition device thereof, medium and computer terminal

A sample image and recognition method technology, applied in neural architecture, instruments, biological neural network models, etc., can solve the problems of incomplete category information of generated samples, inconsistent distribution of sample domains, etc., to speed up research and application, and increase class distinction , the effect of improving task performance

Pending Publication Date: 2022-07-29
ANHUI UNIVERSITY
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to solve the technical problems of inconsistent sample domain distribution and incomplete generated sample category information in the generated model, the present invention provides a zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction, which is similar to the zero-sample image recognition method Corresponding zero-sample image recognition device, computer-readable storage medium and computer terminal using the zero-sample image recognition method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Zero sample image recognition method and recognition device thereof, medium and computer terminal
  • Zero sample image recognition method and recognition device thereof, medium and computer terminal
  • Zero sample image recognition method and recognition device thereof, medium and computer terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0112] The zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction of the present invention mainly has two stages: 1. a model training stage; 2. a zero-sample image recognition stage. Model training includes two parts: the training of the generative model and the training of the unseen class classifier. The first part is to obtain the trained feature generator. The second part first uses the trained feature generator and the semantic features of the unseen class to generate the unseen class. class visual features, and then train an unseen class classifier with these generated unseen class visual features. In the zero-sample image recognition stage, the unseen class image to be classified is sent to the trained unseen class classifier to identify the category to which the unseen class image belongs.

[0113] The zero-sample image recognition method can design a corresponding zero-sample image recognition device based on prototype ...

Embodiment 2

[0176] The zero-sample image recognition method in this embodiment is basically similar to that in Embodiment 1, and is used to recognize visual features of unseen classes. refer to Figure 4 , the method includes the following steps:

[0177] S1. Get the dataset.

[0178] S2. Extract visual features, semantic features and visual prototypes of visible class categories. Using the visual extractor and semantic extractor, the visual features and semantic features of the visible category images, and the unseen category semantic features are extracted. Design a prototype extractor to obtain visual prototypes for each visible class category.

[0179] S3. Use the visual features and semantic features of the visible category images and the visible category visual prototype to train the generative model to obtain a trained feature generator.

[0180] S4. Send the semantic features of the unseen category into the trained feature generator to obtain the generated visual features of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a zero sample image recognition method based on prototype domain alignment and cross-modal reconstruction. The method comprises the following main steps: extracting visual features of visible samples; obtaining visual distribution information of the visible images through a feature distribution encoder; obtaining generated visual features through a feature generator; training and optimizing parameters of the model of the feature generator by designing distribution regularization loss, domain consistency loss, visual reconstruction loss, adversarial loss and semantic reconstruction loss to obtain a trained feature generator; inputting the unseen class semantic features into a trained feature generator to obtain generated unseen class visual features; training an unseen class classifier by using the generated unseen class visual features; and utilizing the trained unseen class classifier to predict the unseen class image. According to the method, through prototype domain consistency alignment and cross-modal reconstruction, the generated visual features are closer to real visual features, and more category discrimination features are included.

Description

technical field [0001] The invention relates to a zero-sample image recognition method in the field of computer vision image recognition, in particular to a zero-sample image recognition method based on prototype domain alignment and cross-modal reconstruction, and a zero-sample image recognition method corresponding to the zero-sample image recognition method. A zero-sample image recognition device, a computer-readable storage medium and a computer terminal using the zero-sample image recognition method. Background technique [0002] Existing image recognition methods need to collect a large number of images with category labels to train the model in the model training stage, and the recognition stage can only identify the categories that have appeared in the training stage. However, in practical scenarios, it is often necessary to identify categories that lack images in the training phase, such as images of endangered species, medical tumor images, etc. In the above appli...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/764G06V10/774G06N3/04
CPCG06V10/764G06V10/774G06N3/045
Inventor 赵鹏刘金辉韩莉
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products