Image zero-order classification model based on cross knowledge and classification method thereof

A sub-classification and image technology, applied in computer parts, character and pattern recognition, instruments, etc., can solve the problems of difficulty in discriminating models, discriminating unseen classes, reducing model performance, etc., to alleviate adverse effects, model and The method is simple and efficient, and the effect of increasing the intersection point

Active Publication Date: 2021-07-30
YUNNAN UNIV
View PDF12 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Cross-modality problem: The cross-modality problem in visual-semantic embedding leads to incomplete representation of semantic features and visual features during embedding, especially in two categories that are very similar and have no difference in embedding space , which makes it difficult for the model to discriminate and greatly reduces the performance o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image zero-order classification model based on cross knowledge and classification method thereof
  • Image zero-order classification model based on cross knowledge and classification method thereof
  • Image zero-order classification model based on cross knowledge and classification method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0089] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0090] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0091] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0092] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0093] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0094] Step S5: Initialize the discriminator and the generator with classification regularization;

[0095] Step S6: Cross-select semantic features and visual features from the visual...

Embodiment 2

[0102] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0103] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0104] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0105] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0106] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0107] Step S5: Initialize the discriminator and the generator with classification regularization;

[0108] Step S6: Cross-select semantic features and visual features from the visual...

Embodiment 3

[0115] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0116] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0117] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0118] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0119] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0120] Step S5: Initialize the discriminator and the generator with classification regularization;

[0121] Step S6: Cross-select semantic features and visual features from the visual...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image zero-order classification model based on cross knowledge. The image zero-order classification model comprises: a biological classification tree module, which is used for constructing a biological classification tree according to all categories in the data set; the visual feature extraction module that is used for converting images in the data set into one-dimensional visual features; the semantic feature extraction module that is used for converting texts or attributes in the data set into one-dimensional semantic features; the cross knowledge learning module that is used for enriching semantic information of categories; the generative adversarial network module that comprises a generator and a discriminator, the generator generates pseudo visual features from the semantic features, and the discriminator is used for discriminating the authenticity and the category of the image. According to the invention, cross knowledge learning is adopted, more related semantic features can be trained, so that features from semantics to vision are embedded in the ZSL, and the semantic features in the cross-modal learning process are enriched; the model and the method are simple and efficient, and high-accuracy classification results are obtained on multiple authoritative data sets.

Description

technical field [0001] The invention relates to the technical field of image classification, in particular to an image zero-order classification model and a classification method based on cross-knowledge. Background technique [0002] Due to the rapid expansion of data scale and the vigorous development of machine learning models, the field of image classification has become more and more attractive. However, collecting sufficient data sets is time-consuming and laborious, and some data sets are not available. How to correctly and efficiently classify some categories in the absence of some data sets has become one of the main challenges in the field of image classification. [0003] To solve the problem of imperfect data sets, the current mainstream solutions in the field first proposed the concept of Zero-Shot Learning (ZSL). It can identify new categories that have not appeared in the training phase during the test phase, that is, it is used to solve the situation where t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06K9/46
CPCG06V10/462G06F18/22G06F18/24G06F18/214
Inventor 曾婷向鸿鑫谢诚
Owner YUNNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products