Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

31results about How to "Improve objective quality" patented technology

Method for constructing convolutional neural network for video coding fractional pixel interpolation

The invention provides a method for constructing a convolutional neural network for video coding fractional pixel interpolation, which comprises the following steps: images with different content andresolution are collected, and an original training data set containing data with different types and coding complexity is formed; preprocessing operation is performed on the original training data setto obtain training data conforming to the video coding inter-frame prediction fractional pixel interpolation characteristic; a deep convolutional neural network is built to obtain a convolutional neural network structure suitable for the video coding inter-frame prediction fractional pixel interpolation; the pre-processed data is input into a built-up convolutional neural network; meanwhile, theoriginal training data set is used as a corresponding true value to train the built-up convolutional neural network. According to the method, the convolutional neural network can be successfully trained; the fractional pixels obtained by using the trained convolutional neural network interpolation meet the requirement for video coding fractional pixel interpolation characteristic; and the method in the invention is used for performing fractional pixel interpolation so that the video coding efficiency can be improved.
Owner:SHANGHAI JIAO TONG UNIV

Image interpolation method based on video object and area guidance

The invention discloses an image interpolation method basing on a video object and the regional guidance. The particular process follows like that: an original image is divided and the position and the region of an interpolation point are determined; to the interpolation point in the internal of a region, a one-dimensional linear interpolation formula is adopted for the evaluation and a two-dimensional nonlinear interpolation formula is adopted for the evaluation of the interpolation point on other positions when the interpolation point is positioned between two horizontal pixels or two vertical pixels at the original image, to the interpolation points in other positions, the two-dimensional nonlinear interpolation formula is used for the evaluation is evaluated when the interpolation point is positioned neither between the two horizontal pixels of the original image nor between the two vertical pixels of the original image; the obtained values of each pixel point are endowed to the pixel on the position of an interpolation point waiting to be interpolated to finish the image interpolation. The image interpolation method basing on the video object and the regional guidance is applicable to the transforming of the resolution of a video object or a whole image.
Owner:XIDIAN UNIV

Method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video

The invention discloses a method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video, which belongs to the technical field of video signal processing, and comprises the following steps: after completing the decoding of a Wyner-Ziv frame code stream, determining the pixel value or conversion coefficient value of the Wyner-Ziv frame into the range of [BL, BU] by a decoder; according to the information such as the statistic correlation between the Wyner-Ziv frame and the side information, the gradient information of the Wyner-Ziv frame in different directions, and grounds between adjacent frames, and the like, modeling the reconstruction of the Wyner-Ziv frame so as to solve the optimal solution problem of Markov random field; introducing the constraint on spatial correlation between adjacent pixels or conversion coefficients in sub-bands by defining energy functions, and calculating some parameters of the defined energy functions by analyzing and using the correlation between adjacent frames by grounds. In the invention, through introducing the constraints on temporal correlation and spatial correlation of the video signals, the subjective quality and objective quality of a reconstruction result can be simultaneously improved.
Owner:SHANGHAI JIAO TONG UNIV

Image super-resolution reconstruction method based on local regression model

The invention discloses an image super-resolution reconstruction method based on a local regression model. The method comprises the following steps: at first, carrying out Gaussian low pass filtering on an input low resolution image to obtain a low frequency band image thereof, carrying out bicubic interpolation to obtain an approximate low frequency band image of a high resolution image; then, applying a one-order regression model to each image block in the low frequency band image of the high resolution image during reconstruction, wherein a mapping function between high / low images in the regression model can be obtained by a machine learning method of an input image, namely, sampling corresponding positions of the input low resolution image and the low frequency band image thereof to obtain sampling image blocks of corresponding positions, and carrying out dictionary training; and finally, respectively applying the one-order regression model to non-local self-similar blocks of the reconstructed image blocks, and carrying out weighted integration to obtain reconstructed high resolution image blocks. By adopting the method provided by the invention, no external image model is required, a prior model is obtained by learning the input image, and the high resolution image reconstructed by the model has better subjective and objective reconstruction effects.
Owner:NANJING UNIV OF POSTS & TELECOMM

Differential image compression perception reconfiguration method based on multi-hypothesis weighting and intelligent terminal

ActiveCN107481293AAlleviate block effectExact iteration initial valueImage codingElastic networkPattern recognition
The invention belongs to the technical field of image encoding and decoding and discloses a differential image compression perception reconfiguration method based on multi-hypothesis weighting and an intelligent terminal. A cross-block-based block compression sensing process is used to sample an original image to obtain a measurement value. A non-local means fully differential iterative reconstruction algorithm is used to carry out iterative reconfiguration processing on the obtained measurement value, and an initial image reconstruction value is obtained. The multiple-hypothesis set acquisition processing is carried out on a current reconstruction value, and the optimized filtering processing is carried out on an obtained multi-hypothesis set to eliminate an inferior hypothesis. The optimized multi-hypothesis set is processed by using a weight estimation model based on an elastic network, a multi-hypothesis weight matrix is obtained, the weighted summation processing is carried out on the multiple hypotheses to obtain side information, and a more accurate iteration initial value is provided for subsequent iterations. According to the method and the intelligent terminal, the spatial correlation of the image is effectively used, the multi-hypothesis weighting processing is used to effectively relieve an over-smoothing problem of a past reconstruction algorithm, and the image reconstruction quality is greatly improved.
Owner:XIDIAN UNIV

Three-dimensional video coding method and device

The invention relates to a three-dimensional video coding method and a three-dimensional video coding device. The three-dimensional video coding method comprises the steps of: acquiring a virtual drawing block obtained through virtual drawing by a B video block and a corresponding coded or uncoded A video block, or a virtual viewpoint image block corresponding to the B video block, and regarding the virtual drawing block or the virtual viewpoint image block as a reference block; coding the B video block in a current coding mode to obtain a precoding B video block, and acquiring a reconstructed virtual drawing block obtained through virtual drawing by the precoding B video block and the corresponding coded or uncoded A video block; calculating space domain distortions and time domain distortions of the reference block and the reconstructed virtual drawing block, merging the space domain distortions and the time domain distortions to obtain a drawing distortion; loading a Lagrangian multiplier of a B video frame to obtain a precoding bit number of the B video block, and calculating a rate-distortion cost according to the drawing distortion, the Lagrangian multiplier and the precoding bit number; traversing codes of all coding modes, regarding the coding mode with the minimal rate-distortion cost as the optimal coding mode of the B video block; and acquiring a code of a next B video block until coding of the B video frames to be coded is completed. Therefore, the three-dimensional video coding efficiency is improved.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Method for dividing and rebuilding video coding predictive residue block

The invention provides a method for dividing and rebuilding a video coding predictive residue block, and belongs to the technical field of video coding in signal treatment. The invention aims to reduce the blocking effect due to the prior DCT transformation matrix and solve the problem that the value of the matrix cannot be adjusted. The division method comprises a predictive residue block classification step, a transverse division step of first, third and second multi-channel filter groups, a classification step of transverse division coefficient matrixes and a longitudinal division step of the first, the third and the second multi-channel filter groups; and the rebuilding method comprises a classification step of the division coefficient matrixes of the predictive residue block, a longitudinal rebuilding step of the first, the third and the second multi-channel filter groups; a classification step of the transverse division coefficient matrixes and a transverse rebuilding step of the first, the third and the second multi-channel filter groups. The method has the advantages of effectively removing correlation, reducing the blocking effect due to the size mismatching of the DCT transformation matrix and the predictive residue block, and improving subjective and objective qualities of coding.
Owner:HUAZHONG UNIV OF SCI & TECH

Video image high-quality transcoding method with excellent error code resistance

The invention provides a video image high-quality transcoding method with excellent error code resistance. According to the video image high-quality transcoding method, a fragmentation adaptive transcoding algorithm is provided for optimizing distortion-limited information source coding at a video image transcoding end, the fragmentation adaptive transcoding algorithm is based on a fragmentation technology of MPEG-4 advanced video coding, combines a distortion limiting information source coding theory, establishes a uniform fragmentation number limit distortion information source coding model,and adopts a strategy of adaptively adjusting the fragment number of each frame of image to consider the code rate and the fault-tolerant performance; and at a video image decoding end, a space-timerelated rapid stepping error masking correction algorithm is provided, on the basis of a rapid stepping image error recovery method, the space-time related rapid stepping error masking correction algorithm is used for correcting a video residual error, and recovers a whole macro block in combination with time domain information of a video image. Experimental results show that the two fault-tolerant algorithms can effectively enhance the fault-tolerant performance of the video image.
Owner:王程

Method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video

The invention discloses a method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video, which belongs to the technical field of video signal processing, and comprises the following steps: after completing the decoding of a Wyner-Ziv frame code stream, determining the pixel value or conversion coefficient value of the Wyner-Ziv frame into the range of [BL, BU] by a decoder; according to the information such as the statistic correlation between the Wyner-Ziv frame and the side information, the gradient information of the Wyner-Ziv frame in different directions, and grounds between adjacent frames, and the like, modeling the reconstruction of the Wyner-Ziv frame so as to solve the optimal solution problem of Markov random field; introducingthe constraint on spatial correlation between adjacent pixels or conversion coefficients in sub-bands by defining energy functions, and calculating some parameters of the defined energy functions by analyzing and using the correlation between adjacent frames by grounds. In the invention, through introducing the constraints on temporal correlation and spatial correlation of the video signals, the subjective quality and objective quality of a reconstruction result can be simultaneously improved.
Owner:SHANGHAI JIAO TONG UNIV

A Method of Image Super-resolution Reconstruction Based on Local Regression Model

The invention discloses an image super-resolution reconstruction method based on a local regression model. The method comprises the following steps: at first, carrying out Gaussian low pass filtering on an input low resolution image to obtain a low frequency band image thereof, carrying out bicubic interpolation to obtain an approximate low frequency band image of a high resolution image; then, applying a one-order regression model to each image block in the low frequency band image of the high resolution image during reconstruction, wherein a mapping function between high / low images in the regression model can be obtained by a machine learning method of an input image, namely, sampling corresponding positions of the input low resolution image and the low frequency band image thereof to obtain sampling image blocks of corresponding positions, and carrying out dictionary training; and finally, respectively applying the one-order regression model to non-local self-similar blocks of the reconstructed image blocks, and carrying out weighted integration to obtain reconstructed high resolution image blocks. By adopting the method provided by the invention, no external image model is required, a prior model is obtained by learning the input image, and the high resolution image reconstructed by the model has better subjective and objective reconstruction effects.
Owner:NANJING UNIV OF POSTS & TELECOMM

A Convolutional Neural Network Construction Method for Fractional Pixel Interpolation in Video Coding

The present invention provides a method for constructing a convolutional neural network for video coding fractional pixel interpolation, comprising: collecting images of different contents and resolutions to form original training data sets containing data of different types and coding complexity; The training data set is preprocessed to obtain training data that conforms to the characteristics of video coding inter-frame prediction fractional pixel interpolation; a deep convolutional neural network is built to obtain a convolutional neural network structure suitable for video coding inter-frame prediction fractional pixel interpolation; The processed data is input into the built convolutional neural network, and the original training data set is used as the corresponding true value to train the built convolutional neural network. The invention ensures that the convolutional neural network can be trained smoothly, and the fractional pixels obtained by using the trained convolutional neural network interpolation meet the characteristic requirements of video encoding fractional pixel interpolation, and the fractional pixel interpolation using the invention can realize the improvement of video encoding efficiency.
Owner:SHANGHAI JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products