Check patentability & draft patents in minutes with Patsnap Eureka AI!

Face attribute editing method based on reinforcement learning

A technology of reinforcement learning and attribute editing, which is applied in the field of face attribute editing based on reinforcement learning, can solve problems such as discontinuous attributes, and achieve the effect of saving computing costs

Pending Publication Date: 2021-05-14
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method is intuitively easier to understand than directly editing attributes in incomprehensible latent vectors, but creates discontinuities in attributes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Face attribute editing method based on reinforcement learning
  • Face attribute editing method based on reinforcement learning
  • Face attribute editing method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] Such as figure 1 As shown, a face attribute editing method based on reinforcement learning includes the following steps:

[0047] (1) Randomly initialize the agent and the environment and clear the experience playback collection;

[0048] (2) Obtain a vector representation of the desired training face in the latent space of the pre-trained generator model and use it as the initial state of the environment;

[0049] (3) Pass the initial state obtained in step (2) into the Actor's current network, and obtain a definite state-action value through the calculation of the Actor's current network, which is the attribute direction vector to be trained;

[0050] (4) Execute the state action value obtained in step (3), enter the next state, record whether the state is a termination state, and get the reward for executing the action;

[0051] (5) Store the previous state, execution action, reward, next state, and whether it is a termination state in steps (2)(3)(4) into the expe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a face attribute editing method based on reinforcement learning. The method comprises the following steps: obtaining an intermediate potential vector representation of a face in a pre-trained generator model space, and taking the intermediate potential vector representation as an initial state of reinforcement learning; inputting the obtained initial state into an Actor network, and calculating to obtain a determined action value as a direction vector of an editing attribute; adding gaussian noise gradually reduced along with the training process to the obtained action value to serve as a final action interacting with the environment; executing the adopted action for interaction to obtain a next state value; inputting the two state values into a generator model and converting the state values into a face image; utilizing a face attribute evaluator to obtain the attribute difference of the two faces and calculating an environment reward; and respectively updating parameters of the Actor network and the Critic network through gradient back propagation of the neural network by using the reward signal. According to the method, a multi-label data set required by training can be reduced, meanwhile, good attribute decoupling is provided, and the identity characteristics of an original face are kept to the maximum extent.

Description

technical field [0001] The invention relates to the technical field of computer vision, in particular to a face attribute editing method based on reinforcement learning. Background technique [0002] In reality, face attribute editing is mostly used in mobile-related face entertainment applications, such as checking one's appearance after old age, changing hairstyles, etc. In the development of deep learning, facial aging and facial makeup have become a new branch independent of face attribute editing. In addition to entertainment applications, they also have broad application prospects in case detection and face forgery. The difficulty of face attribute editing lies in two aspects: first, keep other attributes unchanged when editing a certain attribute; second, although there are many ways to edit face attributes, there are still challenges. [0003] The current mainstream face attribute editing methods are roughly divided into two types: model-based methods and additional...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/62G06N3/08
CPCG06N3/084G06V40/16G06F18/22G06F18/214
Inventor 谭晓阳任国伟
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More