Image zero-order classification model based on cross knowledge and classification method thereof

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A sub-classification and image technology, applied in computer parts, character and pattern recognition, instruments, etc., can solve the problems of difficulty in discriminating models, discriminating unseen classes, reducing model performance, etc., to alleviate adverse effects, model and The method is simple and efficient, and the effect of increasing the intersection point

Active Publication Date: 2021-07-30

YUNNAN UNIV

View PDF12 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] 1. Cross-modality problem: The cross-modality problem in visual-semantic embedding leads to incomplete representation of semantic features and visual features during embedding, especially in two categories that are very similar and have no difference in embedding space , which makes it difficult for the model to discriminate and greatly reduces the performance of the model;

[0006] 2. Cross-domain problems: seen classes may rarely (or not) intersect with unseen classes, and the visual appearance of the same attribute or textual description may be significantly different in invisible classes, which will lead to It is difficult for the model to accurately distinguish unseen classes when embedding semantic vectors into visual space

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0089] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0090] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0091] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0092] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0093] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0094] Step S5: Initialize the discriminator and the generator with classification regularization;

[0095] Step S6: Cross-select semantic features and visual features from the visual...

Embodiment 2

[0102] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0103] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0104] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0105] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0106] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0107] Step S5: Initialize the discriminator and the generator with classification regularization;

[0108] Step S6: Cross-select semantic features and visual features from the visual...

Embodiment 3

[0115] A method for training an image zero-order classification model based on cross-knowledge, comprising the following steps:

[0116] Step S1: Construct a biological taxonomy tree according to all category names in the dataset;

[0117] Step S2: Input the image, text or attribute description in the data set into the visual feature extraction module and the semantic feature extraction module respectively, and extract the visual vector and semantic vector;

[0118] Step S3: According to the biological taxonomy tree and the visual vector, construct visual feature datasets at the Family level, Genus level, and Species level;

[0119] Step S4: According to the biological taxonomy tree and semantic vectors, construct semantic feature data sets at the Family level, Genus level, and Species level;

[0120] Step S5: Initialize the discriminator and the generator with classification regularization;

[0121] Step S6: Cross-select semantic features and visual features from the visual...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an image zero-order classification model based on cross knowledge. The image zero-order classification model comprises: a biological classification tree module, which is used for constructing a biological classification tree according to all categories in the data set; the visual feature extraction module that is used for converting images in the data set into one-dimensional visual features; the semantic feature extraction module that is used for converting texts or attributes in the data set into one-dimensional semantic features; the cross knowledge learning module that is used for enriching semantic information of categories; the generative adversarial network module that comprises a generator and a discriminator, the generator generates pseudo visual features from the semantic features, and the discriminator is used for discriminating the authenticity and the category of the image. According to the invention, cross knowledge learning is adopted, more related semantic features can be trained, so that features from semantics to vision are embedded in the ZSL, and the semantic features in the cross-modal learning process are enriched; the model and the method are simple and efficient, and high-accuracy classification results are obtained on multiple authoritative data sets.

Description

technical field [0001] The invention relates to the technical field of image classification, in particular to an image zero-order classification model and a classification method based on cross-knowledge. Background technique [0002] Due to the rapid expansion of data scale and the vigorous development of machine learning models, the field of image classification has become more and more attractive. However, collecting sufficient data sets is time-consuming and laborious, and some data sets are not available. How to correctly and efficiently classify some categories in the absence of some data sets has become one of the main challenges in the field of image classification. [0003] To solve the problem of imperfect data sets, the current mainstream solutions in the field first proposed the concept of Zero-Shot Learning (ZSL). It can identify new categories that have not appeared in the training phase during the test phase, that is, it is used to solve the situation where t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/62G06K9/46

CPCG06V10/462G06F18/22G06F18/24G06F18/214

Inventor 曾婷向鸿鑫谢诚

Owner YUNNAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Image zero-order classification model based on cross knowledge and classification method thereof

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology