Multi-target unmarked attitude estimation method based on deep convolutional neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A pose estimation and neural network technology, applied in the field of computer vision, can solve problems such as difficulty in the use of personnel, camera distortion, uneven lighting, etc., and achieve the effects of less loss of prediction accuracy, improved accuracy, and good robustness

Active Publication Date: 2019-09-06

NANJING UNIV OF SCI & TECH

View PDF2 Cites 23 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the research on pose estimation faces many challenges, such as the number of targets in an image is usually not predetermined, and it is easy for targets to occlude each other in the case of multiple targets.

Moreover, as the number of targets increases, the operating efficiency of the algorithm is also facing huge challenges.

Before deep learning was widely used, the graph structure (Pictorial Structures) model was widely used in pose estimation, but the results based on the graph structure model were not very accurate, and it was difficult to extend to multi-target pose estimation

[0003] The proposal of deep learning algorithm provides a new direction for multi-target pose estimation. At present, there are two main ideas for multi-target pose estimation. One is based on the top-down algorithm, which first detects all objects from the image. target, and then use the method of single-target pose estimation to estimate the pose of all targets, but the disadvantages of this algorithm are: the operating efficiency decreases with the increase of the number of targets, and some occluded targets cannot be detected, and the accuracy is not high

However, the current bottom-up pose estimation algorithm is susceptible to interference factors such as uneven illumination, background clutter, and camera distortion in practical applications, and its robustness to interference factors is not high.

And whether it is a bottom-up or top-down approach, the steps of algorithm optimization are relatively complicated, which brings great difficulties to users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0054] The multi-target unmarked attitude estimation method based on deep convolutional neural network of the present invention comprises the following contents:

[0055] 1. Acquire image sequences containing target behaviors from videos. Specifically:

[0056] Using the clustering method of visual appearance (K-means), collect image sequences of the behavior of the target of interest under different brightness conditions and background conditions, and ensure that the training data set contains a sufficient number of image sequences (100-200). The image sequence collected in this embodiment is as follows figure 2 shown.

[0057] 2. For each image in the collected image sequence, manually mark the position and category of each target feature part in the same order, and construct a training data set and a test data set according to the marked image sequence. Specifically:

[0058] 70% of the image sequences are randomly selected as the training image set, and the remaining ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-target unmarked attitude estimation method based on a deep convolutional neural network. The multi-target unmarked attitude estimation method comprises the following steps: acquiring an image sequence containing a target behavior from a video; for each image, marking the position and category of each target feature part, and constructing a training data set and a test data set; constructing a multi-target attitude estimation model; training the multi-target attitude estimation model to adjust the weight of the multi-target attitude estimation model, and optimizing the multi-target attitude estimation model; and carrying out precision evaluation on the multi-target attitude estimation model, carrying out target attitude estimation or retraining according to arelation between precision and an expected value, and optimizing the multi-target attitude estimation model. According to the multi-target unmarked attitude estimation method, the accuracy can reachthe human level only through a small number of training images; the target feature part can be effectively tracked under the conditions of messy background, non-uniform illumination, camera distortionand the like; and a good effect can be achieved for different types of targets.

Description

technical field [0001] The invention belongs to the field of computer vision, in particular to a multi-target unmarked pose estimation method based on a deep convolutional neural network. Background technique [0002] The problem of multi-target pose estimation based on computer vision refers to estimating the position and associated information of various parts based on image features. It has a wide range of applications and strong practical value in the fields of human-computer interaction, video surveillance, virtual reality, and automatic driving. However, the research on pose estimation faces many challenges, such as the number of objects in an image cannot be determined in advance, and it is easy for objects to occlude each other in the case of multiple objects. Moreover, as the number of targets increases, the operating efficiency of the algorithm is also facing huge challenges. Before deep learning was widely used, the graph structure (Pictorial Structures) model wa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08

CPCG06N3/084G06V40/20G06V20/40G06V2201/07G06N3/045G06F18/214G06F18/241Y02T10/40

Inventor 白宏阳周育新李政茂郑浦徐啸康郭宏伟梁华驹

Owner NANJING UNIV OF SCI & TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-target unmarked attitude estimation method based on deep convolutional neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology