An Efficient Reinforcement Learning Training Method for Video Coding Optimization

A reinforcement learning and video coding technology, applied in the field of video coding and reinforcement learning, can solve the problems of slow convergence, inability to obtain global optimal solutions, difficulty in generating labels with machine learning methods, etc., to achieve accelerated convergence, high policy utilization, The effect of increasing exploration range and probability

Active Publication Date: 2021-05-07
杭州微帧信息科技有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The invention solves the problem that the traditional method cannot obtain the global optimal solution, and the machine learning method is difficult to generate labels for training
Aiming at the problem that traditional reinforcement learning converges very slowly when there are many network parameters, this invention proposes a pre-training method to speed up the convergence of the algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Efficient Reinforcement Learning Training Method for Video Coding Optimization
  • An Efficient Reinforcement Learning Training Method for Video Coding Optimization
  • An Efficient Reinforcement Learning Training Method for Video Coding Optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following examples will be used in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0024] The high-efficiency reinforcement learning training method for video coding optimization of the present invention specifically optimizes each link in the reinforcement learning training process applied to video coding optimization, so as to speed up the convergence speed and enhance the system learning results, including the following steps,

[0025]Step (1), create a prediction network and a discriminative network. The prediction network is responsible for generating the optimal value of the coding strategy parameters; the discriminant network is responsible for discriminating whether the prediction value generated by the prediction network is good or bad. The prediction network and the discriminant netwo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an efficient reinforcement learning training method for video coding optimization, which is used to solve the problems of slow convergence speed and unstable training of reinforcement learning in the training process of optimizing video coding. The present invention obtains a better initial strategy by first adopting an evolutionary algorithm, and then adopts a supervised learning method to initialize reinforcement learning network parameters, thereby reducing the problem of slow convergence in the initial stage of training caused by random initialization. In the reinforcement learning training process, better strategies are saved, and good strategies are randomly sampled with a certain probability, reducing the problem of too many bad strategies caused by the blindness of reinforcement learning in the exploration process, and improving the convergence speed and stability of training sex. After a certain period of reinforcement learning training, the system may fall into a local optimum, resulting in limited improvement in video coding compression efficiency. The invention performs small-scale random disturbance on the specific parameters of the policy network at regular intervals, increases the range of system exploration, and further improves the compression efficiency of video coding.

Description

technical field [0001] The present invention relates to video coding and reinforcement learning, in particular to an efficient reinforcement learning training method for video coding optimization Background technique [0002] With the continuous development of multimedia digital video applications and the continuous improvement of people's demand for video cloud computing, the data volume of the original video source makes the existing transmission network bandwidth and storage resources unbearable. Therefore, the compression of video signal has become one of the hot spots of academic research and industrial application at home and abroad. Video compression, also known as video coding, aims to eliminate redundant information between video signals. So far, domestic and foreign standardization organizations have successively formulated a variety of different video coding standards. Since the H.261 video coding standard, the mainstream video coding standards have adopted the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04N19/176H04N19/124H04N19/147G06N3/08
CPCG06N3/08H04N19/124H04N19/147H04N19/176
Inventor 梅元刚陈宇金星朱政丁丹丹
Owner 杭州微帧信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products