Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

479 results about "Image prediction" patented technology

Data encoding and decoding

ActiveUS20150043641A1Reduce decreaseColor television with pulse code modulationHigh-definition color television with bandwidth reductionImage resolutionVideo encoding
A video coding or decoding method using inter-image prediction to encode input video data in which each chrominance component has 1/Mth of the horizontal resolution of the luminance component and 1/Nth of the vertical resolution of the luminance component, where M and N are integers equal to 1 or more, comprises: storing one or more images preceding a current image; interpolating a higher resolution version of prediction units of the stored images so that the luminance component of an interpolated prediction unit has a horizontal resolution P times that of the corresponding portion of the stored image and a vertical resolution Q times that of the corresponding portion of the stored image, where P and Q are integers greater than 1; detecting inter-image motion between a current image and the one or more interpolated stored images so as to generate motion vectors between a prediction unit of the current image and areas of the one or more preceding images; and generating a motion compensated prediction of the prediction unit of the current image with respect to an area of an interpolated stored image pointed to by a respective motion vector; in which the interpolating step comprises: applying a ×R horizontal and ×S vertical interpolation filter to the chrominance components of a stored image to generate an interpolated chrominance prediction unit, where R is equal to (U×M×P) and S is equal to (V×N×Q), U and V being integers equal to 1 or more; and subsampling the interpolated chrominance prediction unit, such that its horizontal resolution is divided by a factor of U and its vertical resolution is divided by a factor of V, thereby resulting in a block of MP×NQ samples.
Owner:SONY CORP

Three-dimensional target detection method and device based on multi-sensor information fusion

The invention discloses a three-dimensional target detection method, apparatus and device based on multi-sensor information fusion, and a computer readable storage medium. The three-dimensional targetdetection method comprises the steps: fusing 3D point cloud and an RGB image collected by a laser radar and a camera sensor, and generating an RGB-I image; generating a multi-channel aerial view according to the 3D point cloud so as to determine a region of interest; respectively extracting and fusing region-of-interest features of the RGB-I image and the aerial view based on a convolutional neural network; utilizing a multi-layer perceptron to fuse the confidence coefficient, the approximate position and the size of the image prediction target based on the features of the region of interest,and determining a candidate box; adaptively endowing different pixel weights to different sensor candidate box feature maps based on an attention mechanism, and carrying out skip fusion; and processing the candidate frame feature fusion image by using a multi-layer perceptron, and outputting a three-dimensional detection result. According to the three-dimensional target detection method, apparatus and device, and the computer readable storage medium provided by the invention, the target recognition rate is improved, and the target can be accurately positioned.
Owner:CHANGCHUN INST OF OPTICS FINE MECHANICS & PHYSICS CHINESE ACAD OF SCI

Data encoding and decoding

ActiveUS20150063457A1Color television with pulse code modulationHigh-definition color television with bandwidth reductionImage resolutionVideo encoding
A video coding or decoding method using inter-image prediction to encode input video data in which each chrominance component has 1 / Mth of the horizontal resolution of the luminance component and 1 / Nth of the vertical resolution of the luminance component, where M and N are integers equal to 1 or more, comprises: storing one or more images preceding a current image; interpolating a higher resolution version of prediction units of the stored images so that the luminance component of an interpolated prediction unit has a horizontal resolution P times that of the corresponding portion of the stored image and a vertical resolution Q times that of the corresponding portion of the stored image, where P and Q are integers greater than 1; detecting inter-image motion between a current image and the one or more interpolated stored images so as to generate motion vectors between a prediction unit of the current image and areas of the one or more preceding images; and generating a motion compensated prediction of the prediction unit of the current image with respect to an area of an interpolated stored image pointed to by a respective motion vector; in which the interpolating step comprises: applying a xR horizontal and xS vertical interpolation filter to the chrominance components of a stored image to generate an interpolated chrominance prediction unit, where R is equal to (U×M×P) and S is equal to (V×N×Q), U and V being integers equal to 1 or more; and subsampling the interpolated chrominance prediction unit, such that its horizontal resolution is divided by a factor of U and its vertical resolution is divided by a factor of V, thereby resulting in a block of MP×NQ samples.
Owner:SONY CORP

Visual SLAM method based on point-line fusion

The invention discloses a visual SLAM method based on point-line fusion, and the method comprises the steps: firstly inputting an image, predicting the pose of a camera, extracting a feature point ofthe image, and estimating and extracting a feature line through the time sequence information among a plurality of visual angles; and matching the feature points and the feature lines, tracking the features in front and back frames, establishing inter-frame association, optimizing the pose of the current frame, and optimizing the two-dimensional feature lines to improve the integrity of the feature lines; judging whether the current key frame is a key frame or not, if yes, adding the key frame into the map, updating three-dimensional points and lines in the map, performing joint optimization on the current key frame and the adjacent key frame, and optimizing the pose and three-dimensional characteristics of the camera;and removing a part of external points and redundant key frames; and finally, performing loopback detection on the key frame, if the current key frame and the previous frame are similar scenes, closing loopback, and performing global optimization once to eliminate accumulated errors. Under an SLAM system framework based on points and lines, the line extraction speed and the feature line integrity are improved by utilizing the sequential relationship of multiple view angle images, so that the pose precision and the map reconstruction effect are improved.
Owner:BEIJING UNIV OF TECH

Image prediction/encoding device, image prediction/encoding method, image prediction/encoding program, image prediction/decoding device, image prediction/decoding method, and image prediction decoding

It is possible to provide an image prediction/encoding device, an image prediction/encoding method, an image prediction/encoding program, an image prediction/decoding device, an image prediction/decoding method, and an image prediction/decoding program which can select a plurality of candidate prediction signals without increasing an information amount. A weighting device (234) and an adder (235)perform a process (averaging, for example) of pixel signals extracted by a prediction adjacent region acquisition device (232) by using a predetermined synthesis method so as to generate a comparative signal for an adjacent pixel signal for each of combinations. A comparison/selection device (236) selects a combination having a high correlation between a comparison signal generated by the weighting device (234) and an adjacent pixel signal acquired by an object adjacent region acquisition unit (233). A prediction region acquisition unit (204), a weighting unit (205), and an adder (206) generate a candidate prediction signal and process it by using the predetermined synthesis method so as to generate a prediction signal.
Owner:NTT DOCOMO INC

High-resolution remote sensing image impervious surface extraction method and system based on deep learning and semantic probability

ActiveCN108985238AGet goodReasonable impervious surface extraction resultsMathematical modelsEnsemble learningConditional random fieldSample image
A high-resolution remote sensing image impervious surface extraction method and system based on deep learning and semantic probability. The method includes: obtaining a high-resolution remote sensingimage of a target region, normalizing image data, dividing the image data into a sample image and a test image; constructing a deep convolutional network, wherein the deep convolutional network is composed of a multi-layer convolution layer, a pooling layer and a corresponding deconvolution and deconvolution layer, and extracting image features of each sample image; predicting each sample image pixel by pixel, and constructing a loss function by using the error between the predicted value and the true value, and updating and training the network parameters; extracting the test image features by the deep convolutional network, and carrying out the pixel-by-pixel classification prediction, then constructing a conditional random field model of the test image by using the semantic associationinformation between pixel points, optimizing the test image prediction results globally, and obtaining the extraction results. The invention can accurately and automatically extract the impervious surface of the remote sensing image, and meets the practical application requirements of urban planning.
Owner:WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products