Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Vehicle type recognition method with deep network model based on spatial pyramid pooling

A space pyramid, deep network technology, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problems of different sizes, image geometric deformation, damage to the scale and aspect ratio of the input image, and improve the accuracy. , the effect of improving the robustness

Inactive Publication Date: 2016-08-24
UNIV OF ELECTRONIC SCI & TECH OF CHINA
View PDF3 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the size of the input image of the traditional convolutional neural network architecture is fixed (for example: 256x256), and this artificially changing the size of the input image destroys the scale and aspect ratio of the input image, so there are problems: (1 ) The choice of scale is subjective. For different targets, the most suitable size may be different; (2) For images of different sizes and aspect ratios, forcing the transformation to a fixed size will lose information; (3) The forced transformed image may not contain a complete image, and may also cause geometric deformation of the image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vehicle type recognition method with deep network model based on spatial pyramid pooling
  • Vehicle type recognition method with deep network model based on spatial pyramid pooling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to describe the technical content, structural features, achieved goals and effects of the present invention in detail, the following will be described in detail in conjunction with the embodiments and accompanying drawings.

[0038] The invention proposes a vehicle type recognition method based on a spatial pyramid pooling deep network model, which achieves good results in vehicle type recognition. The schematic diagram of the whole algorithm is shown in figure 1 shown, including steps:

[0039] Step 1: Import the image of the vehicle model database into the deep network model for feature extraction of the convolutional layer to form a feature map of the convolutional layer;

[0040] The first layer of the deep network model is a convolutional layer consisting of 6 feature maps. Each neuron in the feature map is related to the input neighbors are connected. The size of the feature map is , which prevents incoming connections from falling out of bounds. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a vehicle type recognition method with a deep network model based on spatial pyramid pooling. The method comprises the steps that images in a vehicle type database are input into the deep network model for convolution layer characteristic extraction, so that a convolution layer characteristic diagram is constituted; spatial pyramid convolution computation is carried out to each image in the convolution layer characteristic diagram according to different scales, so that a characteristic diagram of spatial pyramid layers is constituted; all the characteristics of the spatial pyramid layers are pooled to constitute full-connected layers, so that a final characteristic representations of vehicle type images can be obtained; the characteristic representations off the vehicle type images are used in training of a linear support vector machine to obtain a vehicle type recognition system; and the characteristic representation of a to-be-recognized vehicle is also acquired and input into the recognition system, so that the vehicle type can be recognized. A traditional deep network model requires that an input image must have a fixed size, so that operations of large-scale vehicle type image data are limited. The method disclosed by the invention adopts the deep network model based on the spatial pyramid pooling, so that the problem is effectively solved; and the method has high practicability and robustness.

Description

technical field [0001] The invention belongs to the technical field of machine learning, pattern classification and recognition, and in particular relates to a car model recognition method based on a deep network model of spatial pyramid pooling. Background of the invention [0002] With the continuous improvement of living standards in modern society and the rapid growth of the number of cars, traffic supervision is facing great challenges. As an important means of traffic supervision, video surveillance system has been widely used in various fields of modern traffic. However, the traditional method of relying on manual interpretation can no longer meet the needs of today's massive traffic video processing, and it has become an inevitable trend to build an intelligent recognition system to automatically process various traffic video information. The identification of vehicle types in traffic video images, as a key technology in construction, has long been widely concerned ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06K9/46
CPCG06V10/40G06V2201/08G06F18/2411
Inventor 李鸿升胡欢曹滨范峻铭周辉
Owner UNIV OF ELECTRONIC SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products