Unlock instant, AI-driven research and patent intelligence for your innovation.

Scale-aware monocular localization and mapping

A scale and image technology, applied in the field of scale fuzzy, can solve the problems that cannot be generalized, difficult to identify, easy to fail, etc.

Pending Publication Date: 2021-11-26
HUAWEI TECH CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This suffers from ill-posed problems, so the results are inaccurate and cannot be generalized to previously unseen scenarios
[0007] (2) Additional sensors, such as a second camera, active depth sensor, or inertial measurement unit (IMU), enable the system to directly recover scale-aware measurements, but this adds additional hardware and requires fusing information from different modalities
The method relies on knowledge about objects, which means that the method can easily fail when faced with unknown objects or known objects that do not match previously seen objects of that type, for example, if the method is familiar with trees such as apple trees, are known to be about 3 meters (10 feet) tall, but it is difficult to identify a sequoia that may be about 100 meters (350 feet) tall as a tree

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scale-aware monocular localization and mapping
  • Scale-aware monocular localization and mapping
  • Scale-aware monocular localization and mapping

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] figure 1 is a schematic diagram of monocular vision. Object 1 forms part of the captured image. In this case, Object 1 is a tree, and an outdoor scene is being captured. The digital video camera 2 has an aperture 4 , approximately a pinhole, through which light can enter in order to record an image 6 on a digital sensor 8 . The raw data recorded on the digital sensor can be processed to reconstruct the image in a human-viewable format, or to appear directly on the camera's screen, and / or so that it can be transmitted to a separate device for further processing and / or Check. As mentioned above, one difficulty with this processing is that inverse mapping the raw data back to an image is ambiguous with respect to the depth of the object (eg object 1), so the scale of the object may not be known.

[0040] To resolve scale ambiguity, instead of a single image, if camera 2 is moving while capturing multiple images as video frames, simultaneous positioning of the cameras a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed is an image processing device comprising a processor configured to estimate the scale of image features by the steps of: processing multiple images of a scene by means of a first trained model to identify features in the images and to estimate the depths of those features in the images; processing the multiple images by a second trained model to estimate a scaling for the images; and estimating the scales of the features by adjusting the estimated depths in dependence on the estimated scaling. A method for training an image processing model is also disclosed.

Description

technical field [0001] The present invention relates to solving the problem of scale ambiguity when projecting images captured by a digital video camera. Background technique [0002] Digital cameras capture images by light falling on a digital sensor that converts the collected light into digital data, known as raw data. The raw data is then processed to create a raster image formed as a field of pixels, where each pixel represents the brightness and color received at an area of ​​the digital sensor representing an area of ​​the captured image. The field of pixels can then be displayed on the screen so it can be seen by a human. In this way, projection of the captured image is achieved. Both raw and processed data can be stored and transferred to another device, allowing further reproduction of captured images as and when required. [0003] If the image is a scene (could be an outdoor scene or an indoor scene), the image usually includes many objects or features. In som...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/55
CPCG06T7/55G06T2207/10016G06T2207/10021G06T2207/20084G06T2207/20081G06T7/73G06T3/40G06T7/60G06T3/18
Inventor 本杰明·布萨姆格雷戈里·斯拉堡
Owner HUAWEI TECH CO LTD