Unlock instant, AI-driven research and patent intelligence for your innovation.

A text-guided image restoration method and system

A repair method and text technology, applied in the field of image repair, can solve the problems of insufficient diversity of image repair, achieve the effect of improving accuracy and diversity, and improving performance

Active Publication Date: 2020-12-18
ZHEJIANG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem of lack of diversity in image repair in the prior art, introduce text-guided technology into the field of image repair, and propose a text-guided image repair method and system, using text description to guide the model to generate consistent semantics, Visually coherent and controllable images, a coarse-to-fine cross-modal generative network and a new supervisory signal guidance model are proposed to repair images step by step, and a text reconstruction module is used to guide the model to guide text based on the generated repair images Mask parts for prediction, improving visual-text consistency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text-guided image restoration method and system
  • A text-guided image restoration method and system
  • A text-guided image restoration method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0104] The present invention is verified experimentally on two large data sets of CUB-200-2011 and Oxford-102, wherein, the data set of CUB-200-2011 contains 11788 images, including 200 kinds of birds of different categories; and the data of Oxford-102 The set contains 8189 images containing 102 different categories of flowers.

[0105] The present invention randomly divides them into disjoint training and test data sets, and randomly selects 3 titles for each image as text input; first scales the input image so that the smaller value of its height and width is 128 , and cut out a 128×128 image in the center as the source image, and generate a sequence of images to be repaired with a length of 4, The center mask size is 64×64, The blur radius is 4, 2, 1 in turn; for the input text, use NLTK for word segmentation, and use the pre-trained word2vec Glove of the cased-300d version for feature extraction.

[0106] In terms of comparison objects, since there is no similar resear...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text-guided image restoration technology method and system, belonging to the field of computer vision image restoration. It mainly includes the following steps: 1) For a set of image and guide text training set, through a coarse-to-fine image inpainting module, learn the joint representation of image information and text information and generate reasonable inpainted images. 2) For the image generated by the image inpainting module, a text reconstruction module is used to learn the semantic correlation between the inpainted image and the guiding text, and infer the mask part of the guiding text. The invention adopts the guiding text guidance model to generate semantically consistent, visually coherent and controllable images, adopts a coarse-to-fine cross-modal generation network and a new type of supervisory signal guidance model to gradually repair images, and uses a text reconstruction module to guide Based on the generated inpainted images, the model predicts the masked parts of the guide text, improving visual-text consistency.

Description

technical field [0001] The invention relates to the field of image restoration, in particular to a text-guided image restoration method and system. Background technique [0002] Image inpainting is a basic and important topic in the field of computer vision, which aims to complement the missing regions of a partially masked image and output a reasonable image. Most existing image completion methods complete missing regions by extending or borrowing information from surrounding regions, and these methods work well when missing regions are similar to their surrounding regions. However, the images generated by these methods will be unsatisfactory if there is not enough relevant information available in the surrounding area. [0003] In recent years, many artificially guided image inpainting techniques have been proposed, but most of them adopt structure-based methods, such as artificially adding boundary lines, delimiting reference areas and specifying extension directions, et...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T5/00G06N3/08G06N3/02
CPCG06N3/08G06N3/02G06T2207/20084G06T2207/20081G06T5/77
Inventor 赵洲童鑫远蔡登何晓飞
Owner ZHEJIANG UNIV