A method for removing logos and subtitles in images based on deep neural networks

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A deep neural network and image-based technology, applied in the field of removing logos and subtitles in images based on deep neural networks, can solve problems such as poor ability in complex scenes, large amount of calculation, and limitations of application scenarios, and achieve fitting Strong ability, good restoration results, and realistic visual effects

Active Publication Date: 2021-07-27

CHENGDU SOBEY DIGITAL TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0010] The purpose of the present invention is: in order to solve the problems that the existing image restoration methods have poor ability to restore complex scenes in audio-visual images, require a large amount of calculation, and have limitations in application scenarios, the present invention provides a deep neural network-based The method of removing the logo and subtitles in the image, combining the characteristics of the convolutional neural network and the generation of the confrontation network, constructs the image repair model, and trains the image repair model with a large amount of data, and then removes the video frames that need to remove the logo and subtitles Bring in the image repair model for calculation, and automatically get the image frame without the logo and subtitles. The repair has strong real-time performance and a wide range of applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0050] Such as figure 1 As shown, this embodiment provides a method for removing the logo and subtitles in an image based on a deep neural network, including the following steps:

[0051] S1. Establish an image repair model: the image repair model is composed of a "U-net" network and a GAN, and the "U-net" network is used as a Generator of the GAN;

[0052] S2. Image preprocessing of the training set: crop or scale the images in the training set to a limited size to obtain the training image. In this embodiment, the length and width of the training image are limited to 512*512mm. According to the area where the station logo and subtitles are usually located, the The training images are logically partitioned as figure 2 Area 1, area 2 and area 3 are shown, wherein area 1 is the area where the station logo is located under normal conditions, area 2 is the area where subtitles are located under normal conditions, and corresponding Mask1 and Mask2 are generated in area 1 and are...

Embodiment 2

[0071] This embodiment is further optimized on the basis of the embodiment, specifically:

[0072] The class "U-net" network in S4 is composed of a convolutional layer and a deconvolution layer, and the processing flow of the class "U-net" network to the training image P1 and the training image P2 includes a downsampling process and an upsampling process, so In the down sampling process, the feature size is reduced by a convolution kernel with a step size of 2, and in the up-sampling process, the feature size is enlarged by a convolution kernel with a step size of 1 / 2; the class "U-net" network performs training image P1 When calculating with the training image P2, there is a ReLU activation function after each convolution and deconvolution operation.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for removing logos and subtitles in an image based on a deep neural network, and relates to the technical field of image restoration. The invention includes the following steps: S1, establishing an image restoration model; S2, preprocessing training set images; S3 , processing training data: the training image is used as the real image Pt; the RGB value of the pixel in the Mask1 area in the training image is set to 0, as the training image P1; the RGB value of the pixel in the Mask2 area in the training image is set to 0, As the training image P2; S4, training the image restoration model to obtain the trained image restoration model; S5, image restoration: after preprocessing the image or video that needs to remove the logo and subtitles, input the trained image restoration model, and The image output by the image restoration model is combined with the original image to obtain the final image output. Based on the idea of deep learning, the present invention realizes automatic and rapid removal of the station logo and subtitles in the image.

Description

technical field [0001] The present invention relates to the technical field of image restoration, and more specifically relates to a method for removing a logo and subtitles in an image based on a deep neural network. Background technique [0002] With the rapid development of the Internet and mobile Internet, the total number of pictures and videos on the Internet and the playback time are constantly increasing. In 2017, the total daily video playback on YouTube alone exceeded 1 billion hours. There are station logos and subtitles in some video images, and the original video images cannot be obtained due to age or other reasons. How to remove the station logo and subtitles in real time and quickly in the existing video images to reduce the impact of the station logo or subtitles? Unfavorable interference of content, so that the audience can obtain a good visual experience is an urgent problem to be solved at present. [0003] Existing image restoration methods include the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06K9/32G06K9/34G06N3/04G06N3/08

CPCG06N3/08G06V20/635G06V10/267G06N3/045

Inventor 王炜李杰温序铭谢超平

Owner CHENGDU SOBEY DIGITAL TECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A method for removing logos and subtitles in images based on deep neural networks

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology