Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Scene text segmentation method based on weak supervision deep learning

A technology of deep learning and text segmentation, applied in the field of image processing, to achieve the effect of broadening the scope of application, improving efficiency, and reducing algorithm costs

Active Publication Date: 2019-10-11
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF6 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention is to provide a scene text segmentation method based on weakly supervised deep learning, based on a fully convolutional semantic segmentation network, which does not require any strong supervised pixel-level labeled samples, and solves the problem in scene text detection simply and efficiently. The problem of text segmentation greatly reduces the cost of the algorithm and improves the efficiency of scene text detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene text segmentation method based on weak supervision deep learning
  • Scene text segmentation method based on weak supervision deep learning
  • Scene text segmentation method based on weak supervision deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0032] A method for scene text segmentation based on weakly supervised deep learning of the present invention, which is based on a fully convolutional semantic segmentation network, comprises the following steps:

[0033] Step 1: Generate training data

[0034] Superimpose the scene picture with any text to generate scene text picture training data, and label it as the scene picture itself.

[0035] In this example, multiple groups of life scene pictures were selected from random searches on the Internet, and a 224*224 background area was randomly cut out. After tool operation, Chinese and English text samples from any angle were added to the background area to obtain training scene text pictures. There are no special requirements for the required life scene pictures and text samples, the data sources are extremely wide, the data thr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a scene text segmentation method based on weak supervision deep learning, and the method comprises the following steps: enabling a scene picture to be superposed with any text to generate a scene text picture, generating a training sample, and labeling the training sample as the scene picture; performing feature extraction by using a convolutional neural network to obtain high-level semantics step by step; performing up-sampling through deconvolution to gradually recover the high-level semantic feature map to the size of the input image; carrying out multi-scale fusion on the feature maps output by the convolution layer and the deconvolution layer; activating the fused feature map to obtain a dichotomy black-and-white image of the scene and the text; setting a loss function for training; and corroding and expanding the scene text segmentation map obtained after training to generate a text region bounding box. The method does not need any strongly supervised pixel-level annotation sample, simply and efficiently solves the text segmentation problem in scene text detection, greatly reduces the algorithm cost, and improves the scene text segmentation efficiency.

Description

technical field [0001] The invention belongs to the field of image processing, and in particular relates to a scene text segmentation method based on weakly supervised deep learning. Background technique [0002] Extracting textual information from images of real-world scenes has become increasingly popular in recent years. Scene text detection (a technique for localizing text in natural scene images) plays an indispensable role in various text reading systems. Text detection in natural scene images is more complicated than general object detection. One of the main reasons is that the background of natural scene text images is extremely complex and diverse. For example, text can appear on flat, curved or wrinkled surfaces; near the text area There are complex interference textures, or non-text areas have text-like textures, etc. Because of the heavy noise, the algorithm has difficulty identifying text instances when locating bounding boxes. [0003] In the traditional met...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T7/49G06T7/194G06T7/136G06T5/30G06K9/20G06N3/04G06N3/08
CPCG06T7/49G06T5/30G06T7/194G06T7/136G06N3/08G06V10/22G06N3/048G06N3/045
Inventor 杨路曹阳李佑华
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products