Efficient reinforcement learning training method for video coding optimization

A technology of reinforcement learning and video coding, which is applied in the field of video coding and reinforcement learning, can solve problems such as the inability to obtain the global optimal solution, difficulty in generating labels with machine learning methods, and slow convergence, so as to achieve high policy utilization and increase the scope of exploration and probability, the effect of speeding up the convergence

Active Publication Date: 2019-10-18
杭州微帧信息科技有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The invention solves the problem that the traditional method cannot obtain the global optimal solution, and the machine learning method is difficult to generate labels for training
Aiming at the problem that traditional reinforcement learning converges very slowly when there are many network parameters, this invention proposes a pre-training method to speed up the convergence of the algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Efficient reinforcement learning training method for video coding optimization
  • Efficient reinforcement learning training method for video coding optimization
  • Efficient reinforcement learning training method for video coding optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following examples will be used in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0024] The high-efficiency reinforcement learning training method for video coding optimization of the present invention specifically optimizes each link in the reinforcement learning training process applied to video coding optimization, so as to speed up the convergence speed and enhance the system learning results, including the following steps,

[0025]Step (1), create a prediction network and a discriminative network. The prediction network is responsible for generating the optimal value of the coding strategy parameters; the discriminant network is responsible for discriminating whether the prediction value generated by the prediction network is good or bad. The prediction network and the discriminant netwo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an efficient reinforcement learning training method for video coding optimization, which is used for solving the problems of low convergence rate and unstable training of reinforcement learning in the training process of video coding optimization. According to the method, firstly, a good initial strategy is obtained by adopting an evolutionary algorithm, and then reinforcement learning network parameters are initialized by adopting a supervised learning method, so that the problem of slow convergence of a training initial stage caused by random initialization is reduced. In the reinforcement learning training process, a good strategy is stored, the strategy is randomly sampled at a certain probability, the problem that too many bad strategies are caused by blindnessof reinforcement learning in the exploration process is reduced, and the convergence speed and stability of training are improved. After a certain stage of reinforcement learning training, the systemmay fall into local optimum, thereby causing limited improvement of video coding compression efficiency. According to the method, small-amplitude random disturbance is carried out on specific parameters of the strategy network at certain intervals, the system exploration range is widened, and the compression efficiency of video coding is further improved.

Description

technical field [0001] The present invention relates to video coding and reinforcement learning, in particular to an efficient reinforcement learning training method for video coding optimization Background technique [0002] With the continuous development of multimedia digital video applications and the continuous improvement of people's demand for video cloud computing, the data volume of the original video source makes the existing transmission network bandwidth and storage resources unbearable. Therefore, the compression of video signal has become one of the hot spots of academic research and industrial application at home and abroad. Video compression, also known as video coding, aims to eliminate redundant information between video signals. So far, domestic and foreign standardization organizations have successively formulated a variety of different video coding standards. Since the H.261 video coding standard, the mainstream video coding standards have adopted the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04N19/176H04N19/124H04N19/147G06N3/08
CPCG06N3/08H04N19/124H04N19/147H04N19/176
Inventor 梅元刚陈宇金星朱政丁丹丹
Owner 杭州微帧信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products