Character image generation method guided by text based on generative adversarial network
An image generation and character technology, applied in the field of computer vision, can solve problems such as changing the pose and attributes of characters
Active Publication Date: 2019-07-16
ZHEJIANG UNIV
View PDF3 Cites 34 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Some previous methods considered the problem of changing the pose of the character, and some methods aimed at text-image generation, while fewer methods considered changing the pose and attributes of the character according to the description of the text
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment
[0067] In this embodiment, the character pose generator and the character picture generator are learned according to the aforementioned steps S1-S5. The implementation methods of each step are as described above, and the specific steps will not be elaborated in detail. The following only shows the effect of the case data . This example is implemented in the CUHK-PEDES dataset with text annotations. The images are derived from 5 pedestrian re-identification datasets: CUHK03, Market-1501, SSM, VIPER and CUHK01, which contain a total of 40206 pictures of 13003 people.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More Abstract
The invention discloses a character image generation method based on text guidance of a generative adversarial network, and belongs to the field of computer vision. The method specifically comprises the following steps: obtaining a figure image data set for training, and defining an algorithm target; acquiring attitude information of all images in the character image data set, and acquiring basicattitudes from all attitude information through a clustering algorithm; learning from characters to a predicted posture by using a posture predictor based on the generative adversarial network; predicting a corresponding figure posture from the text by using the posture predictor obtained by learning in the steps S2 to S3; using a character picture generator based on a generative adversarial network ito learn character picture generation conforming to text description, and meanwhile, using a multi-modal error to establish a mapping relation between a picture sub-region and a text. The character image generation method based on text guidance of the generative adversarial network has good application value in scenes of image generation, image editing, pedestrian re-identification and the like.
Description
technical field [0001] The invention belongs to the field of computer vision, in particular to a method for generating character images guided by texts based on generative confrontation networks. Background technique [0002] Text-guided person image generation is defined as the problem of simultaneously changing the pose and attributes (e.g., clothing color) of a person in a reference image to match the text description, according to the target text description. In recent years, in the fields of computer vision tasks such as specific image generation, image retrieval, and person re-identification, generative methods have played an important role in generating images with specified content, expanding data sets, and increasing the robustness of algorithms. There are two key points in this task: the first is how to predict the target pose of the character from the text, the target pose should be consistent with the text description, and serve as a guide for the transformation ...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More IPC IPC(8): G06T11/00G06N3/04G06N3/08
CPCG06T11/00G06N3/08G06N3/044G06N3/045
Inventor 周星然黄思羽李斌李英明张仲非
Owner ZHEJIANG UNIV



