Video predictive encoding method based on neural networks
A neural network and predictive coding technology, applied in the field of video predictive coding based on neural network, can solve the problem of increasing coding complexity, and achieve the effect of improving coding efficiency, reducing complexity, and optimizing decision-making process.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0041] Such as Figure 4 with Figure 5 As shown, the present embodiment provides a neural network-based video predictive coding method, including the following steps:
[0042] S1. Input a coding tree unit with a size of 64×64, and use a Bayesian classifier to make a rough judgment on it to determine whether the SKIP mode is adopted. If so, determine that the current coding tree unit is not divided down, and directly obtain the coding tree unit The coding unit size decision of , otherwise, execute S2;
[0043] The judgment method of the Bayesian classifier is as follows:
[0044] Consider whether to use the SKIP mode as a two-category problem, and the two categories are marked as y 1 and y 2 , P(y j ) is the prior probability, and the conditional probability of the class is P(x|y j ), j is a mark of two categories, which can be 1 or 2, which means that SKIP is not executed or executed, P(y j |x) is the posterior probability, the calculation formula is:
[0045]
[0...
Embodiment 2
[0072] This embodiment is further optimized on the basis of Embodiment 1, specifically:
[0073] The three neural networks in the S2 are trained using the training data set, and the training method is as follows:
[0074] Step 1. Perform data augmentation preprocessing on the images in the training data set;
[0075] Step 2, performing 0-1 regularization on the preprocessed image data;
[0076] Step 3. The regularized image is input into the first neural network, the regularized image is divided into 4 equal parts and then input into the second neural network, and the regularized image is divided into 16 equal parts and then input into the third neural network. The neural network is trained;
[0077] In the step 1, the preprocessing of data augmentation on the images in the training data set specifically includes four kinds of image transformations, and the four kinds of image transformations are specifically:
[0078] a. Flip the image horizontally and vertically;
[0079...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com