Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for generating vision dictionary set by combining different clustering algorithms

A clustering algorithm and visual dictionary technology, applied in the field of image classification based on visual dictionaries, can solve problems such as excessive supervision, complex models, and poor robustness, and achieve the effects of low supervision, simple models, and low requirements

Inactive Publication Date: 2012-07-25
JIANGXI UNIV OF SCI & TECH
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problems of too complex models, too strong supervision and poor robustness in traditional object recognition, the present invention provides a method of combining different clustering algorithms to generate a collective visual dictionary, using visual dictionaries to parallelly utilize existing A variety of information to identify objects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0024] The Harris-Laplace salient region detector is used to detect the salient region of the image, and the C-SFIT descriptor is used to describe the salient region, and the size of the member visual dictionary is set to 2000. In order to improve the performance of members, a spatial pyramid structure 1x1+2x2+1x3 is used. A descriptor corresponds to its nearest word in Euler space. After forming a member visual dictionary, in order to quantify the image, all detected salient regions are used to build a histogram based on this member visual dictionary. To make the histogram independent of the number of descriptors, the histogram vector is normalized to sum to 1. The visual dictionary is obtained by applying a clustering algorithm to a set of 200,000 descriptors randomly selected from the training image set. Weighted LibSVM is used to train the classifier. In the training phase, the weight of positive samples is set to , and the weight of the negative sample is set to , w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for generating vision dictionary set and relates to the technical fields of mode recognition, computer vision and image understanding. In order to achieve the purpose of capturing different data structures of natural objects and detecting clusters in different shapes and sizes, a framework is required for synthesizing the outputs of a plurality of clustering algorithms, and different clustering algorithms are conducted on a local vision describe subset of a same training image set so as to obtain a vision dictionary set. Different quantized vectors of the same image are obtained on the basis of the vision dictionary set. A classifier set is obtained by learning on different express vector sets of the same training image set. The construction of the vision dictionary set is non-supervised type and a member vision dictionary is independently and concurrently constructed by using different clustering algorithms. A test result shows that the method provided by the invention is capable of obviously increasing the performance of a single vision dictionary, has robustness on background noise and is excellent in recognizing effect.

Description

technical field [0001] The invention belongs to the technical fields of pattern recognition, computer vision and image understanding, and in particular relates to an image classification method based on a visual dictionary. Background technique [0002] The current popular method for image classification is the "bag-of-words" model. Although the "bag-of-words" model is not explicitly shape-modelled, the learned model is effective for irregularly shaped objects or highly structured object classes. After detecting independent salient region blocks and calculating descriptors (that is, feature representations) for these independent blocks, a visual dictionary is obtained by clustering the descriptors of a specific training image set, and then the image is quantized based on the visual dictionary and input into the traditional The classifier gets the classification result. The degree of learning supervision of current image classification methods is generally relatively strong...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
Inventor 罗会兰刘发升胡春安
Owner JIANGXI UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products