Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-label image preprocessing method for gesture recognition based on deep learning

An image preprocessing and gesture recognition technology, applied in character and pattern recognition, instruments, computing, etc., can solve the problems of decreased accuracy, increased image redundancy, deformation of target objects, etc., to improve accuracy and reduce complexity , the effect of reducing the number of channels

Active Publication Date: 2021-05-25
HANGZHOU DIANZI UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the consequence of this is that the former compresses the aspect ratio of the original image, causing the detected target object in the image to deform, losing texture features, and reducing the accuracy rate; although the latter retains texture features, it will add many of the same The channel of the pixel value, which does not belong to the information in the image, increases the redundancy of the image, so it will also have a great impact on the accuracy
[0003] At the same time, when the proportion of the target in the image is small and the background is complex, there is too much useless information in the picture, which is not conducive to the extraction of target features by the neural network. The background is redundant and noisy, and the neural network will It is not easy to converge during training, the training time is longer, it will burden the server GPU, and the accuracy of the model will also be affected

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-label image preprocessing method for gesture recognition based on deep learning
  • Multi-label image preprocessing method for gesture recognition based on deep learning
  • Multi-label image preprocessing method for gesture recognition based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below in conjunction with accompanying drawing.

[0027] Such as Figure 1-5 Shown, in order to realize above-mentioned object, the technical scheme that the present invention adopts is as follows:

[0028] Step 1. Obtain the coordinate points (x 1 ,y 1 ),(x 2 ,y 2 )...(x 21 ,y 21 ), the coordinate points of the multiple feature points are also labels during network learning.

[0029] Step 2. Select the maximum value x of x in the image plane coordinate system from the coordinate points max and minimum x min , the maximum value of y y max and minimum y min , confirm that the coordinate point A(x min ,y min ) and coordinate point B(x max ,y max );

[0030] x max =max[x 1 ,x 2 ...x 21 ];

[0031] x min =min[x 1 ,x 2 ...x 21 ];

[0032] the y max =max[y 1 ,y 2 ...y 21 ];

[0033] the y min =min[y 1 ,y 2 ...y 21 ];

[0034] Step 3. If figure 1 As shown, using the coordinate point A(x min ,y...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-label image preprocessing method for gesture recognition based on deep learning. The steps of the present invention are as follows: 1. obtain the coordinates of a plurality of marked feature points; 2. select the maximum and minimum values ​​of x in the image plane coordinate system in the coordinates, the maximum and minimum values ​​of y, and confirm the coordinate point A (x min ,y min ) and coordinate point B(x max ,y max ); 3. Use the coordinates A and B to locate the square area P where the target label is valid in the image, and at the same time, a margin should be left at the coordinate points on the edge of the area P according to the rules, so as to obtain the expanded square area P 1 , and for the region P 1 The length and width of the update; 4. Compare the area P 1 length and width to get a new square area P 2 ; 5. Calculate the coordinates of the feature points in the clipping image as labels of the clipping image. The present invention cuts out a square image containing a target from an original image, adds as few channels as possible, reduces background redundancy, and retains target features.

Description

technical field [0001] The invention relates to an image preprocessing method for gesture recognition based on deep learning, which is suitable for feature points with complex backgrounds and multiple labels as two-dimensional coordinates of the recognized target. Background technique [0002] In order to make the loss value of the neural network converge better and faster, and to obtain a model with an excellent recognition rate, the trained 3D images will undergo a series of preprocessing before being input into the network. At present, no matter whether researchers use deep learning for classification tasks or regression tasks, they will directly scale the length and width of the original image of the dataset to the same size, or add 0 channels on the short side to ensure that the original image’s aspect ratio remains unchanged. Make the length and width consistent, which is the normalization of the scale. But the consequence of this is that the former compresses the asp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00
CPCG06V40/113
Inventor 颜成钢吕晓泉张勇东
Owner HANGZHOU DIANZI UNIV