Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

31results about How to "Improve objective quality" patented technology

Method for constructing convolutional neural network for video coding fractional pixel interpolation

The invention provides a method for constructing a convolutional neural network for video coding fractional pixel interpolation, which comprises the following steps: images with different content andresolution are collected, and an original training data set containing data with different types and coding complexity is formed; preprocessing operation is performed on the original training data setto obtain training data conforming to the video coding inter-frame prediction fractional pixel interpolation characteristic; a deep convolutional neural network is built to obtain a convolutional neural network structure suitable for the video coding inter-frame prediction fractional pixel interpolation; the pre-processed data is input into a built-up convolutional neural network; meanwhile, theoriginal training data set is used as a corresponding true value to train the built-up convolutional neural network. According to the method, the convolutional neural network can be successfully trained; the fractional pixels obtained by using the trained convolutional neural network interpolation meet the requirement for video coding fractional pixel interpolation characteristic; and the method in the invention is used for performing fractional pixel interpolation so that the video coding efficiency can be improved.
Owner:SHANGHAI JIAO TONG UNIV

Image interpolation method based on video object and area guidance

The invention discloses an image interpolation method basing on a video object and the regional guidance. The particular process follows like that: an original image is divided and the position and the region of an interpolation point are determined; to the interpolation point in the internal of a region, a one-dimensional linear interpolation formula is adopted for the evaluation and a two-dimensional nonlinear interpolation formula is adopted for the evaluation of the interpolation point on other positions when the interpolation point is positioned between two horizontal pixels or two vertical pixels at the original image, to the interpolation points in other positions, the two-dimensional nonlinear interpolation formula is used for the evaluation is evaluated when the interpolation point is positioned neither between the two horizontal pixels of the original image nor between the two vertical pixels of the original image; the obtained values of each pixel point are endowed to the pixel on the position of an interpolation point waiting to be interpolated to finish the image interpolation. The image interpolation method basing on the video object and the regional guidance is applicable to the transforming of the resolution of a video object or a whole image.
Owner:XIDIAN UNIV

A multi-sensor image fusion method and system based on GDGF

The invention belongs to the technical field of optical sensor image processing, and discloses a multi-sensor image fusion method and a system based on gradient domain guiding filter. The method smoothes a source image by using a mean filter, removes a small structure in the source image, and decomposes the source image to obtain a base layer and a detail layer of the source image. Laplace high-pass filter and Gaussian low-pass filter are used to filter the source image, and the salient image is obtained. The weight map of the corresponding source image is obtained by comparing the salient pixel size of the source image. The source image is used as the guide image and the weight map is decomposed by GDGF to get the basic layer and the detail layer of the weight map respectively. Accordingto certain fusion rules, the pixels corresponding to the base layer and the detail layer are fused by using the optimized weight map base layer and the detail layer, and the fused base layer and the detail layer are merged to obtain the fused image. The invention can effectively save the detail information in the source image, greatly improves the subjective and objective quality of the fused image, and has robustness to image registration.
Owner:LUOYANG NORMAL UNIV

Method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video

The invention discloses a method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video, which belongs to the technical field of video signal processing, and comprises the following steps: after completing the decoding of a Wyner-Ziv frame code stream, determining the pixel value or conversion coefficient value of the Wyner-Ziv frame into the range of [BL, BU] by a decoder; according to the information such as the statistic correlation between the Wyner-Ziv frame and the side information, the gradient information of the Wyner-Ziv frame in different directions, and grounds between adjacent frames, and the like, modeling the reconstruction of the Wyner-Ziv frame so as to solve the optimal solution problem of Markov random field; introducing the constraint on spatial correlation between adjacent pixels or conversion coefficients in sub-bands by defining energy functions, and calculating some parameters of the defined energy functions by analyzing and using the correlation between adjacent frames by grounds. In the invention, through introducing the constraints on temporal correlation and spatial correlation of the video signals, the subjective quality and objective quality of a reconstruction result can be simultaneously improved.
Owner:SHANGHAI JIAO TONG UNIV

Method and apparatus for improving video quality by utilizing a unified loop filter

The present invention relates to method and apparatus for improving video quality. The present invention provides a unified loop filter including: a pixel determining unit which determines the type of a pixel based on boundary strength; a similarity transforming unit which transforms a nonlinear filter into a nonlinear similarity-ordered statistics filter; and an integrating unit which integrates the nonlinear similarity-ordered statistics filter with a linear image filtering portion. The unified loop filter is applicable to filter reconstructed frames when an encoder or a decoder is processing a video signal.
Owner:HONG KONG APPLIED SCI & TECH RES INST

Image super-resolution reconstruction method based on local regression model

The invention discloses an image super-resolution reconstruction method based on a local regression model. The method comprises the following steps: at first, carrying out Gaussian low pass filtering on an input low resolution image to obtain a low frequency band image thereof, carrying out bicubic interpolation to obtain an approximate low frequency band image of a high resolution image; then, applying a one-order regression model to each image block in the low frequency band image of the high resolution image during reconstruction, wherein a mapping function between high / low images in the regression model can be obtained by a machine learning method of an input image, namely, sampling corresponding positions of the input low resolution image and the low frequency band image thereof to obtain sampling image blocks of corresponding positions, and carrying out dictionary training; and finally, respectively applying the one-order regression model to non-local self-similar blocks of the reconstructed image blocks, and carrying out weighted integration to obtain reconstructed high resolution image blocks. By adopting the method provided by the invention, no external image model is required, a prior model is obtained by learning the input image, and the high resolution image reconstructed by the model has better subjective and objective reconstruction effects.
Owner:NANJING UNIV OF POSTS & TELECOMM

AGF-based multi-focus image fusion method and system

The invention belongs to the technical field of optical image processing and discloses an AGF-based multi-focus image fusion method and system. The method comprises the steps of firstly smoothing an input image by using joint bilateral filtering, and alternately using a source image and a filtered image as the input image and a guide image of bilateral filtering; performing filtering processing onthe bilaterally filtered image by using median filtering to obtain a basic layer and a detail layer of the source image; calculating gradient energy of neighborhood windows of pixels of the basic layer and the detail layer of the source image, establishing a decision matrix according to the gradient energy of the neighborhood windows of the pixels of the basic layer and the detail layer, and fusing the pixels corresponding to the basic layer and the detail layer according to a certain fusion rule. The accuracy of judging a focus region in the source image can be effectively improved and the quality of a fused image can be greatly improved.
Owner:LUOYANG NORMAL UNIV

Improved sample adaptive offset filtering method based on histogram analysis

Provided is an improved sample adaptive offset filtering method based on histogram analysis. The method comprises the steps of analyzing histogram distribution according to gray values of sample values in coding tree blocks of a reconstruction frame, classifying the coding tree blocks according to the histogram distribution, dividing adaptive sample offset filtering into a narrow coding tree block mode, a wide coding tree block mode, a double-center coding tree block mode and a default mode, respectively calculating the optimal rate distortion cost values under different classification modes, selecting the mode corresponding to the minimum rate distortion cost value as a truly-adopted band filtering mode and coding and transmitting a corresponding band starting position and an offset value. Three more accurate and more efficient filtering classification methods are newly added according to the characteristics of coding tree block histogram distribution so as to improve the accuracy of the sample adaptive offset filtering method, and the subjective and objective quality of videos can be effectively improved under the condition of same code rate.
Owner:ACAD OF BROADCASTING SCI SARFT +1

Stereoscopic video whole frame loss error hiding method based on structural similarity

The invention discloses a stereoscopic video whole frame loss error hiding method based on structural similarity. The stereoscopic video whole frame loss error hiding method based on the structural similarity is effectively combined with subjective perceptions from human eyes to picture structure information, respectively uses a motion compensation forecasting method or a parallax compensation forecasting method to recover errors aiming at different macro block reference modes of macro blocks in a dropped frame by judging the macro block reference modes of reference image frames of a moment before the dropped frame, and further is intensively combined with the subjective perceptions from the human eyes to picture structure similarity due to the fact that time domain pertinence and pertinences among viewpoints of traditional stereoscopic video are fully considered, and therefore not only objective quality of recovering of the dropped frame can be improved, but also subjective quality of the recovering of the dropped frame can be enabled to be close to the perceptions of the human eyes.
Owner:NINGBO UNIV

Differential image compression perception reconfiguration method based on multi-hypothesis weighting and intelligent terminal

ActiveCN107481293AAlleviate block effectExact iteration initial valueImage codingElastic networkPattern recognition
The invention belongs to the technical field of image encoding and decoding and discloses a differential image compression perception reconfiguration method based on multi-hypothesis weighting and an intelligent terminal. A cross-block-based block compression sensing process is used to sample an original image to obtain a measurement value. A non-local means fully differential iterative reconstruction algorithm is used to carry out iterative reconfiguration processing on the obtained measurement value, and an initial image reconstruction value is obtained. The multiple-hypothesis set acquisition processing is carried out on a current reconstruction value, and the optimized filtering processing is carried out on an obtained multi-hypothesis set to eliminate an inferior hypothesis. The optimized multi-hypothesis set is processed by using a weight estimation model based on an elastic network, a multi-hypothesis weight matrix is obtained, the weighted summation processing is carried out on the multiple hypotheses to obtain side information, and a more accurate iteration initial value is provided for subsequent iterations. According to the method and the intelligent terminal, the spatial correlation of the image is effectively used, the multi-hypothesis weighting processing is used to effectively relieve an over-smoothing problem of a past reconstruction algorithm, and the image reconstruction quality is greatly improved.
Owner:XIDIAN UNIV

H.265-baased rate control method, system and device

The invention discloses an H.265-baased rate control method, system and device. The method comprises the following steps: judging whether a current frame is a scene switching frame; if yes, setting the current frame as an I frame, updating a quantization parameter according to the relation between the quantization parameter and a Lagrangian multiplier method coefficient, and carrying out target bit distribution on new GOP and new frame layers; if no, continuing to carry out target bit distribution on the new GOP and the new frame layers; calculating the quantization parameter by virtue of an R-Q model and a frame layer target bit distribution result; carrying out rate-distortion optimization according to the calculated quantization parameter, and selecting an optimal coding mode; coding inthe optimal coding mode; if the current frame is the last frame, ending the operation; and otherwise, updating model parameters. The system comprises modules (1-11). The device comprises a memory anda processor for executing the method. According to the method, the system and the device, the objective quality of scene switching videos can be improved; and the method, the system and the device can be widely applied to the video coding field.
Owner:GUANGZHOU HISON COMP TECH

Edge enhancement improved SPIHT image coding and decoding method

InactiveCN105828088AImprove objective qualitySave the amount of synchronous informationDigital video signal modificationDecoding methodsObjective quality
The present invention discloses an edge enhancement improved SPIHT image coding and decoding method. According to the method, the high-pass filtering is carried out on an image low frequency sub-band after wavelet transformation to extract the main contour and the edges of an image, so that on one hand, the synchronization between a coding terminal and a decoding terminal is realized by utilizing the lowest frequency sub-band, the important parameters can be positioned without needing to transmit the synchronization information additionally, and the synchronization information content of the conventional SPIHT coding is reduced; on the other hand, the priority degree of the coding and decoding is controlled according to a high pass filtering result of the low frequency sub-band, and the priority decoding is carried out on the main contour and the edges which are most sensitive to a human visual system in the image, and accordingly, the subjective and objective quality of the decoded images is improved.
Owner:LIAONING NORMAL UNIVERSITY

Three-dimensional video coding method and device

The invention relates to a three-dimensional video coding method and a three-dimensional video coding device. The three-dimensional video coding method comprises the steps of: acquiring a virtual drawing block obtained through virtual drawing by a B video block and a corresponding coded or uncoded A video block, or a virtual viewpoint image block corresponding to the B video block, and regarding the virtual drawing block or the virtual viewpoint image block as a reference block; coding the B video block in a current coding mode to obtain a precoding B video block, and acquiring a reconstructed virtual drawing block obtained through virtual drawing by the precoding B video block and the corresponding coded or uncoded A video block; calculating space domain distortions and time domain distortions of the reference block and the reconstructed virtual drawing block, merging the space domain distortions and the time domain distortions to obtain a drawing distortion; loading a Lagrangian multiplier of a B video frame to obtain a precoding bit number of the B video block, and calculating a rate-distortion cost according to the drawing distortion, the Lagrangian multiplier and the precoding bit number; traversing codes of all coding modes, regarding the coding mode with the minimal rate-distortion cost as the optimal coding mode of the B video block; and acquiring a code of a next B video block until coding of the B video frames to be coded is completed. Therefore, the three-dimensional video coding efficiency is improved.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Method for dividing and rebuilding video coding predictive residue block

The invention provides a method for dividing and rebuilding a video coding predictive residue block, and belongs to the technical field of video coding in signal treatment. The invention aims to reduce the blocking effect due to the prior DCT transformation matrix and solve the problem that the value of the matrix cannot be adjusted. The division method comprises a predictive residue block classification step, a transverse division step of first, third and second multi-channel filter groups, a classification step of transverse division coefficient matrixes and a longitudinal division step of the first, the third and the second multi-channel filter groups; and the rebuilding method comprises a classification step of the division coefficient matrixes of the predictive residue block, a longitudinal rebuilding step of the first, the third and the second multi-channel filter groups; a classification step of the transverse division coefficient matrixes and a transverse rebuilding step of the first, the third and the second multi-channel filter groups. The method has the advantages of effectively removing correlation, reducing the blocking effect due to the size mismatching of the DCT transformation matrix and the predictive residue block, and improving subjective and objective qualities of coding.
Owner:HUAZHONG UNIV OF SCI & TECH

Video image high-quality transcoding method with excellent error code resistance

The invention provides a video image high-quality transcoding method with excellent error code resistance. According to the video image high-quality transcoding method, a fragmentation adaptive transcoding algorithm is provided for optimizing distortion-limited information source coding at a video image transcoding end, the fragmentation adaptive transcoding algorithm is based on a fragmentation technology of MPEG-4 advanced video coding, combines a distortion limiting information source coding theory, establishes a uniform fragmentation number limit distortion information source coding model,and adopts a strategy of adaptively adjusting the fragment number of each frame of image to consider the code rate and the fault-tolerant performance; and at a video image decoding end, a space-timerelated rapid stepping error masking correction algorithm is provided, on the basis of a rapid stepping image error recovery method, the space-time related rapid stepping error masking correction algorithm is used for correcting a video residual error, and recovers a whole macro block in combination with time domain information of a video image. Experimental results show that the two fault-tolerant algorithms can effectively enhance the fault-tolerant performance of the video image.
Owner:王程

Stereo video bitrate control method based on binocular vision characteristic

The invention discloses a stereo video bitrate control method based on a binocular vision characteristic. The method is characterized in that bitrate control is conducted on a stereo image group layer, a stereo image pair layer and a frame layer respectively; in the stereo image group layer, the bit number of each stereo image group and a quantization parameter of a key frame are calculated; in the stereo image pair layer, the target bit number of each stereo image pair is calculated according to the surplus bit number and the buffer area saturation; in the frame layer, a stereo index bitrate and quantization parameter model is built according to the human eye binocular vision masking effect to optimize a stereo rate distortion model to enable more target bits to be distributed in a left image and fewer target bits to be distributed in a right image, and finally the quantization parameters of non key frames can be determined. The method has the advantage of effectively improving stereo video objective quality and rate distortion performance.
Owner:NINGBO UNIV

Three-dimensional image reduction method

The invention discloses a three-dimensional image reduction method which comprises the following steps: calculating gradients in the directions of 0 degree, 45 degrees and 135 degrees according to formula (1), formula (2) and formula (3), wherein weighting coefficients are exhaustively chosen from 10 groups of coefficients according to PSNR (peak signal to noise ratio); and calculating interpolations in the directions of 0 degree, 45 degrees and 135 degrees by adopting the principle of interpolating in the direction with the smallest gradient, wherein formula (1), formula (2) and formula (3) refer to the patent specification. As the three-dimensional image reduction method adopts the principle of interpolating in the direction with the smallest gradient, image objective quality and particularly the subjective quality are improved.
Owner:雷欧尼斯(北京)信息技术有限公司

Underwater image enhancement method based on unsupervised color correction

The invention belongs to the field of ocean engineering, and particularly relates to an underwater image enhancement method based on unsupervised color correction. According to the method, the limitation that the channel B is used as a protruding channel is improved, so that the color distortion problem of processing the non-blue-color-cast underwater image is effectively solved; The method comprises an RGB color model, a contrast correction method, an HIS color model and an underwater image enhancement method based on unsupervised color correction. Compared with a common underwater image enhancement method, the underwater image enhancement method has the advantages that the underwater image enhancement method is improved into a self-adaptive method and has good processing effects on different color cast; The image is effectively balanced, color cast is eliminated, the illumination is improved, and the true color is increased. The method is easy to implement, small in calculated amountand high in reliability, the feasibility of underwater optical image recognition is improved, and the method has positive significance for development of unmanned underwater vehicles in the aspects of underwater operation tasks and the like in the future.
Owner:HARBIN ENG UNIV

Ternary-representation-based image predictive coding method

The invention relates to a ternary-representation-based image predictive coding method, which is particularly suitable for the compression processing of static images. The method comprises the following steps of: performing wavelet transform and wavelet coefficient quantization on an image, representing each wavelet coefficient by using a ternary number and scanning a ternary wavelet coefficient plane; selecting a nearest-neighbor coefficient of a symbol currently to be coded as a prediction coefficient, defining an importance state function, an importance state direction weighting function and an importance state and function expression of the prediction coefficient for the characteristics of three symbols, and establishing a high-efficiency prediction model; and calculating a predicted value according to the prediction model, transmitting the symbol currently to be coded into a corresponding arithmetic coder for entropy coding, and resetting an initial value of the arithmetic coder between frequency bands. By the method, the high-efficiency predictive coding of wavelet coefficients is realized, and the objective quality of image recovery is effectively improved.
Owner:北京畅景立达软件技术有限公司

Image coding prediction method based on local minimum entropy

The invention relates to an image coding prediction method based on local minimum entropy. The method is especially suitable for compression processing of a static image. The method is characterized by: carrying out wavelet transformation and wavelet coefficient quantification to the image; selecting a wavelet coefficient which has a strong correlation with a bit to be coded as a prediction coefficient; defining an importance state function, an importance state direction weighting function and an importance state and a function of the prediction coefficient; taking reduction of entropy as a discrimination criteria, establishing a local optimum prediction model so as to classify data of the bit to be coded. By using the prediction model established in the invention, high efficient predictive coding of the wavelet coefficient can be realized. An experiment result shows that under a same compression ratio, objective quality of the recovery image can be effectively raised compared with a static image compression standard JPEG 2000.
Owner:北京畅景立达软件技术有限公司

Method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video

The invention discloses a method for reconstructing distributed video coding based on constraints on temporal-spatial correlation of video, which belongs to the technical field of video signal processing, and comprises the following steps: after completing the decoding of a Wyner-Ziv frame code stream, determining the pixel value or conversion coefficient value of the Wyner-Ziv frame into the range of [BL, BU] by a decoder; according to the information such as the statistic correlation between the Wyner-Ziv frame and the side information, the gradient information of the Wyner-Ziv frame in different directions, and grounds between adjacent frames, and the like, modeling the reconstruction of the Wyner-Ziv frame so as to solve the optimal solution problem of Markov random field; introducingthe constraint on spatial correlation between adjacent pixels or conversion coefficients in sub-bands by defining energy functions, and calculating some parameters of the defined energy functions by analyzing and using the correlation between adjacent frames by grounds. In the invention, through introducing the constraints on temporal correlation and spatial correlation of the video signals, the subjective quality and objective quality of a reconstruction result can be simultaneously improved.
Owner:SHANGHAI JIAO TONG UNIV

Ternary-representation-based image predictive coding method

The invention relates to a ternary-representation-based image predictive coding method, which is particularly suitable for the compression processing of static images. The method comprises the following steps of: performing wavelet transform and wavelet coefficient quantization on an image, representing each wavelet coefficient by using a ternary number and scanning a ternary wavelet coefficient plane; selecting a nearest-neighbor coefficient of a symbol currently to be coded as a prediction coefficient, defining an importance state function, an importance state direction weighting function and an importance state and function expression of the prediction coefficient for the characteristics of three symbols, and establishing a high-efficiency prediction model; and calculating a predicted value according to the prediction model, transmitting the symbol currently to be coded into a corresponding arithmetic coder for entropy coding, and resetting an initial value of the arithmetic coder between frequency bands. By the method, the high-efficiency predictive coding of wavelet coefficients is realized, and the objective quality of image recovery is effectively improved.
Owner:北京畅景立达软件技术有限公司

A three-primary color combined pre-equalization and deblurring underwater image enhancement method

The invention relates to an underwater image enhancement method combining pre-equalization and deblurring of three primary colors, which specifically includes: processing the image using a color-corrected histogram equalization method; reprocessing the corrected image by using a dark channel model; improving the image Estimation of background light; optimization of transmission map estimation; restoration of image scenes. The invention can obtain good visual effect and objective quality, and has the advantages of simple calculation, good restoration quality and the like.
Owner:DONGHUA UNIV

Three-dimensional image reduction method

The invention discloses a three-dimensional image reduction method which comprises the following steps: calculating gradients in the directions of 0 degree, 45 degrees and 135 degrees according to formula (1), formula (2) and formula (3), wherein weighting coefficients are exhaustively chosen from 10 groups of coefficients according to PSNR (peak signal to noise ratio); and calculating interpolations in the directions of 0 degree, 45 degrees and 135 degrees by adopting the principle of interpolating in the direction with the smallest gradient, wherein formula (1), formula (2) and formula (3) refer to the patent specification. As the three-dimensional image reduction method adopts the principle of interpolating in the direction with the smallest gradient, image objective quality and particularly the subjective quality are improved.
Owner:雷欧尼斯(北京)信息技术有限公司

Improved spiht image encoding and decoding method with edge enhancement

InactiveCN105828088BImprove objective qualitySave the amount of synchronous informationDigital video signal modificationDecoding methodsEngineering
The present invention discloses an edge enhancement improved SPIHT image coding and decoding method. According to the method, the high-pass filtering is carried out on an image low frequency sub-band after wavelet transformation to extract the main contour and the edges of an image, so that on one hand, the synchronization between a coding terminal and a decoding terminal is realized by utilizing the lowest frequency sub-band, the important parameters can be positioned without needing to transmit the synchronization information additionally, and the synchronization information content of the conventional SPIHT coding is reduced; on the other hand, the priority degree of the coding and decoding is controlled according to a high pass filtering result of the low frequency sub-band, and the priority decoding is carried out on the main contour and the edges which are most sensitive to a human visual system in the image, and accordingly, the subjective and objective quality of the decoded images is improved.
Owner:LIAONING NORMAL UNIVERSITY

A stereoscopic video rate control method based on binocular vision characteristics

The invention discloses a stereo video bitrate control method based on a binocular vision characteristic. The method is characterized in that bitrate control is conducted on a stereo image group layer, a stereo image pair layer and a frame layer respectively; in the stereo image group layer, the bit number of each stereo image group and a quantization parameter of a key frame are calculated; in the stereo image pair layer, the target bit number of each stereo image pair is calculated according to the surplus bit number and the buffer area saturation; in the frame layer, a stereo index bitrate and quantization parameter model is built according to the human eye binocular vision masking effect to optimize a stereo rate distortion model to enable more target bits to be distributed in a left image and fewer target bits to be distributed in a right image, and finally the quantization parameters of non key frames can be determined. The method has the advantage of effectively improving stereo video objective quality and rate distortion performance.
Owner:NINGBO UNIV

A Method of Image Super-resolution Reconstruction Based on Local Regression Model

The invention discloses an image super-resolution reconstruction method based on a local regression model. The method comprises the following steps: at first, carrying out Gaussian low pass filtering on an input low resolution image to obtain a low frequency band image thereof, carrying out bicubic interpolation to obtain an approximate low frequency band image of a high resolution image; then, applying a one-order regression model to each image block in the low frequency band image of the high resolution image during reconstruction, wherein a mapping function between high / low images in the regression model can be obtained by a machine learning method of an input image, namely, sampling corresponding positions of the input low resolution image and the low frequency band image thereof to obtain sampling image blocks of corresponding positions, and carrying out dictionary training; and finally, respectively applying the one-order regression model to non-local self-similar blocks of the reconstructed image blocks, and carrying out weighted integration to obtain reconstructed high resolution image blocks. By adopting the method provided by the invention, no external image model is required, a prior model is obtained by learning the input image, and the high resolution image reconstructed by the model has better subjective and objective reconstruction effects.
Owner:NANJING UNIV OF POSTS & TELECOMM

A rate control method for hevc coding unit level

The invention provides an HEVC encoding unit level code rate control method. An encoding unit SubCU further divided by LCU is used for carrying out level code rate control initialization, bit number allocation, encoding parameter estimation and other processes of P frame and B frame encoding units. At first, former LCU encoding information of the same level frame and at the same position is used for predicating whether the current frame LCU is divided. The LCU predicted to be divided, and an R-D model of the SubCU is used for estimating the complexity and the weight in the encoding unit level code rate control initialization part. In a bit number allocation phase, the state of an encoder and the complexity of the current LCU are respectively considered to calculate a target bit number of the current LCU. In an encoding parameter estimation phase, the R-D model of the SubCU is used for estimating the encoding parameter of the current LCU.
Owner:TONGJI UNIV

A Convolutional Neural Network Construction Method for Fractional Pixel Interpolation in Video Coding

The present invention provides a method for constructing a convolutional neural network for video coding fractional pixel interpolation, comprising: collecting images of different contents and resolutions to form original training data sets containing data of different types and coding complexity; The training data set is preprocessed to obtain training data that conforms to the characteristics of video coding inter-frame prediction fractional pixel interpolation; a deep convolutional neural network is built to obtain a convolutional neural network structure suitable for video coding inter-frame prediction fractional pixel interpolation; The processed data is input into the built convolutional neural network, and the original training data set is used as the corresponding true value to train the built convolutional neural network. The invention ensures that the convolutional neural network can be trained smoothly, and the fractional pixels obtained by using the trained convolutional neural network interpolation meet the characteristic requirements of video encoding fractional pixel interpolation, and the fractional pixel interpolation using the invention can realize the improvement of video encoding efficiency.
Owner:SHANGHAI JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products