Video bit enhancement method based on attention mechanism
A technology of attention and video, applied in image enhancement, television, image analysis and other directions, can solve the problem of flicker between frames of high-bit video sequences, and achieve the effect of improving the perceived visual quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0052] Embodiments of the present invention include the following steps:
[0053] 101: From 16-bit Sintel database [9] Randomly select the original 1000 groups of video sequences, each group of 5 video frames, and quantize it to 4-bit depth, and apply the zero-filling algorithm to the 4-bit depth video sequence to expand the 4-bit video sequence into a 16-bit depth video sequence , the 16-bit-depth video frame extended by applying the zero-fill algorithm is called a rough high-bit-depth video frame;
[0054] 102: In this embodiment, the codec is used as the basic network architecture, and a global attention alignment module is added to the head of the encoder. This module can capture long-distance dependencies by calculating the correlation between intra-frame and inter-frame video sequences, and perform implicit Motion Estimation and Motion Compensation (ME&MC); add a target-guided semantic attention module at the connection between the encoder and decoder, which fuses the s...
Embodiment 2
[0059] Below in conjunction with specific experimental data, the effect evaluation of the embodiment 1 scheme is carried out, see the following description for details:
[0060] 301: Data composition
[0061] The test set consists of 50 groups of 16-bit continuous video frames randomly selected from the Sintel database that do not repeat the training set and 30 groups of 16-bit continuous video frames randomly selected from the TOS database, each group contains 5 frames of images.
[0062] 302: Evaluation Criteria
[0063] The present invention mainly adopts two kinds of evaluation indicators to evaluate the quality of the reconstructed high bit depth video frame:
[0064] Peak Signal to Noise Ratio (PSNR) is a commonly used objective image quality assessment method for evaluating images.
[0065] Structural Similarity Index (SSIM) [12] It is an index to measure the structural similarity of two images. This index measures the similarity of two images from the perspectives ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com