Method for generating vision dictionary set by combining different clustering algorithms

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A clustering algorithm and visual dictionary technology, applied in the field of image classification based on visual dictionary, can solve the problems of excessive supervision, complex model and poor robustness, and achieve the effect of low supervision, simple model and low requirements.

Inactive Publication Date: 2015-04-08

JIANGXI UNIV OF SCI & TECH

View PDF2 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In order to solve the problems of too complex models, too strong supervision and poor robustness in traditional object recognition, the present invention provides a method of combining different clustering algorithms to generate a collective visual dictionary, using visual dictionaries to parallelly utilize existing A variety of information to identify objects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment

[0024] The Harris-Laplace salient region detector is used to detect the salient region of the image, and the C-SFIT descriptor is used to describe the salient region, and the size of the member visual dictionary is set to 2000. In order to improve the performance of members, a spatial pyramid structure 1x1+2x2+1x3 is used. A descriptor corresponds to its nearest word in Euler space. After forming a member visual dictionary, in order to quantify the image, all detected salient regions are used to build a histogram based on this member visual dictionary. To make the histogram independent of the number of descriptors, the histogram vector is normalized to sum to 1. The visual dictionary is obtained by applying a clustering algorithm to a set of 200,000 descriptors randomly selected from the training image set. Weighted LibSVM is used to train the classifier. In the training phase, the weight of positive samples is set to , and the weight of the negative sample is set to , w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for generating vision dictionary set and relates to the technical fields of mode recognition, computer vision and image understanding. In order to achieve the purpose of capturing different data structures of natural objects and detecting clusters in different shapes and sizes, a framework is required for synthesizing the outputs of a plurality of clustering algorithms, and different clustering algorithms are conducted on a local vision describe subset of a same training image set so as to obtain a vision dictionary set. Different quantized vectors of the same image are obtained on the basis of the vision dictionary set. A classifier set is obtained by learning on different express vector sets of the same training image set. The construction of the vision dictionary set is non-supervised type and a member vision dictionary is independently and concurrently constructed by using different clustering algorithms. A test result shows that the method provided by the invention is capable of obviously increasing the performance of a single vision dictionary, has robustness on background noise and is excellent in recognizing effect.

Description

technical field [0001] The invention belongs to the technical fields of pattern recognition, computer vision and image understanding, and in particular relates to an image classification method based on a visual dictionary. Background technique [0002] The current popular method for image classification is the "bag-of-words" model. Although the "bag-of-words" model is not explicitly shape-modelled, the learned model is effective for irregularly shaped objects or highly structured object classes. After detecting independent salient region blocks and calculating descriptors (that is, feature representations) for these independent blocks, a visual dictionary is obtained by clustering the descriptors of a specific training image set, and then the image is quantized based on the visual dictionary and input into the traditional The classifier gets the classification result. The degree of learning supervision of current image classification methods is generally relatively strong...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G06K9/62

Inventor罗会兰刘发升胡春安

OwnerJIANGXI UNIV OF SCI & TECH

Method for generating vision dictionary set by combining different clustering algorithms

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Examples

specific Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology