Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

RGB-D multi-mode fusion person detection method based on asymmetric double-flow network

A technology of RGB-D and personnel detection, applied in biological neural network models, character and pattern recognition, instruments, etc., can solve the problems of unable to detect end-to-end pedestrians, difficult to fully and effectively mine, large storage space, etc., to reduce parameters, enhanced robustness, and improved detection accuracy

Pending Publication Date: 2020-04-03
BEIJING UNIV OF TECH
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] However, this method needs to manually extract the gradient direction histogram of the traditional RGBD image as an image feature, which is time-consuming and laborious and takes up a large storage space, and cannot realize end-to-end pedestrian detection; the gradient direction histogram feature is relatively simple, and it is difficult to extract RGB and depth Discriminatory features in the image are used for pedestrian detection; this method uses a simple fusion of RGB and depth image features, and it is difficult to fully and effectively mine the fine-grained information such as color and texture of the RGB image and the semantic information such as edge and depth provided by the depth image. , to realize the association and complementarity between multi-modal data, which has great limitations in improving the accuracy of pedestrian detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RGB-D multi-mode fusion person detection method based on asymmetric double-flow network
  • RGB-D multi-mode fusion person detection method based on asymmetric double-flow network
  • RGB-D multi-mode fusion person detection method based on asymmetric double-flow network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Apparently, the described embodiments are some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention. The present invention will be described in detail below through specific examples.

[0031] The schematic diagram of the method provided by the embodiment of the present invention is as follows image 3 shown, including the following steps:

[0032] S1: Use a camera capable of shooting RGB images and depth images at the same time to obtain the original RGB ima...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an RGB-D multi-modal fusion person detection method based on an asymmetric double-flow network, and belongs to the field of computer vision and image processing. The method comprises the steps of RGBD image acquisition, depth image preprocessing, RGB feature extraction and Depth feature extraction, RGB multi-scale fusion and Depth multi-scale fusion, multi-modal feature channel reweighting and multi-scale personnel prediction. According to the method, an asymmetric RGBD double-flow convolutional neural network model is designed to solve the problem that a traditional symmetric RGBD double-flow network is prone to causing depth feature loss. Multi-scale fusion structures are designed for RGBD double-flow networks respectively, so multi-scale information complementation is achieved. A multi-modal reweighting structure is constructed, the RGB and Depth feature maps are combined, and weighted assignment is performed on each combined feature channel to realize modelautomatic learning contribution proportion. Person classification and frame regression are carried out by using multi-modal features, so the accuracy of personnel detection is improved while the real-time performance is ensured, and the robustness of detection under low illumination at night and personnel shielding is enhanced.

Description

technical field [0001] The invention belongs to the field of computer vision and image processing, and in particular relates to an RGB-D multi-modal fusion personnel detection method based on an asymmetric dual-stream network. Background technique [0002] In recent years, the fields of smart home, smart building, and smart security have developed rapidly. The wide application of video extraction and analysis technology has become a key driving force for its progress. Among them, the detection and statistics of people have gradually become the core of image video analysis and artificial intelligence. A hot research topic. In terms of smart home, by detecting the indoor personnel, the position of the person can be located, the behavior habits of the personnel can be recorded, and the intelligent equipment such as indoor lighting and air conditioning can be further adjusted to provide people with a more comfortable and intelligent home environment. In terms of smart buildings...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/40G06K9/46G06N3/04
CPCG06V40/10G06V10/30G06V10/56G06N3/045
Inventor 张文利郭向杨堃王佳琪
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products