Image feature representation method based on multi-semantic codebook

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An image feature and multi-semantic technology, applied in the field of computer vision of signal processing, can solve problems such as rough correspondence and inconformity with the real spatial distribution relationship, and achieve reduced redundancy and storage requirements, strong distinguishing ability, and strong distinguishing Effect

Active Publication Date: 2019-02-22

SHANGHAI JIAOTONG UNIV

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the corresponding relationship obtained by artificially dividing blocks is too rough and does not conform to the real spatial distribution relationship of each element in the image.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0034] The image representation method based on the multi-task semantic codebook of the present invention uses the technical theory of multi-task learning to jointly train multiple semantic codebooks to encode and quantify the local features of the image, and designs an image descriptor based on semantic context Represents the visual features of the entire image. Based on the local image features extracted from different semantic types in the image, a set of dense semantic codebooks are trained, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention relates to an image feature representation method based on a multi-semantic codebook. The method performs the following processing on the images in the input training set: Step 1: intensively calculate image local features on the input image, and combine all local features According to the given semantic annotations, it is divided into several categories; the second step: according to the local features of multiple semantic categories in the first step, an optimization problem of joint learning is established, and a global codebook and multiple semantic codebooks are obtained by solving; the third step: Use the local features of each semantic category to train the corresponding semantic classifier for each semantic category; the fourth step: use the global codebook, semantic codebook, and semantic classifier to perform context-based feature quantization and semantic aggregation on the image, and finally represent into an image feature vector, that is, an image representation. Experiments prove that this method can represent the visual features of images more finely, and has higher accuracy in scene recognition than traditional methods.

Description

technical field [0001] The invention relates to a method in the technical field of computer vision for signal processing, in particular to an image feature representation method based on a multi-semantic codebook. Background technique [0002] The basic framework of the traditional image classification algorithm based on the Bag-of-Words Model mainly includes four parts: (1) feature extraction; (2) feature quantification; (3) feature aggregation; (4) image classification . The first step of feature extraction intensively calculates a large number of local features at various positions and scales of the image. Commonly used local image features include SIFT, HOG, LBP, etc. The second step of feature quantization is to quantize each feature into a discrete value according to a given codebook, which is generally the sequence number of the codeword closest to the feature vector in the codebook. The codebook can be obtained through sample clustering, and commonly used methods ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06K9/62

CPCG06F18/2148G06F18/24147

Inventor 熊红凯王博韬

Owner SHANGHAI JIAOTONG UNIV

Image feature representation method based on multi-semantic codebook

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology