Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text detection method of document image in natural scene

A natural scene and text detection technology, applied in the field of image processing, can solve problems such as lack of precision, long detection time, increased calculation amount, etc., to achieve the effect of enriching scenes, accurate results, and preventing model overfitting

Active Publication Date: 2018-01-19
BEIJING UNIV OF TECH
View PDF3 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] (1) The sliding window needs to traverse the image at multiple scales and judge each detection window, resulting in long detection time and low efficiency
[0010] (2) Lack of precision, difficult to deal with complex backgrounds
This method fails when the background is complex and the edge cannot be detected
At the same time, this method will mistakenly detect many objects with regular lines and similar to text, such as strips, grids, bricks, etc., as text
Unable to cope with the user's demand for shooting multiple scenes in a natural environment
[0012] MSER, does not work well with blurry, lighting, color, texture changes, low-contrast text
[0013] Both SWT and MSER methods detect a single character, and the detection result is inconvenient for the OCR module. It is necessary to merge the detected single characters according to the character spacing, height difference and other characteristics, which increases the amount of calculation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text detection method of document image in natural scene
  • Text detection method of document image in natural scene
  • Text detection method of document image in natural scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] Such as figure 1 As shown, the present invention provides a text detection method of a document image in a natural scene, comprising the following steps:

[0039] Step 1. Select 3816 commonly used Chinese characters, use different font types such as Song, Hei, Kai, and Lishu to make Chinese character pictures, and add a certain amount of salt and pepper noise and Gaussian noise to form data set 1. Among them, the training images in data set 1 For Chinese characters of different font types, the label is the designated label of the corresponding Chinese character.

[0040]Step 2. The neural network model has many parameters and requires a large amount of data training to prevent overfitting. Due to the high cost of labeled samples, the limited labeled samples need to be expanded. Randomly rotate the marked document image, and the rotation angle rotate∈[-30,30]. Random cropping, original image width, height, new image newWidth∈[0.7×width,width], newHeight∈[0.7×height,he...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text detection method of a document image in a natural scene. Commonly used Chinese characters are selected to make Chinese character pictures, a dataset 1 is formed, randomrevolving and cropping operations are carried out on labeled document images, then a manner of Poisson cloning is used to fuse different background images, and a dataset 2 is formed; the dataset 1 isadopted to carry out training of a text classification model on a VGG16 network, and after the model converges, obtained parameters are used to initialize a fully convolutional neural network model, and the dataset 2 is used to train the model; the trained fully-convolutional neural network model is used to process the image, a classification situation of each pixel point is obtained according toa maximum probability method, and a text-non-text binary image is formed; a method of connected regions is used to obtain text regions, the original image is binarized, and only text information in the text regions in the text-non-text region binary image is extracted to obtain a text binary image; the image is corrected through a maximum variance method; and projection is carried out again on thecorrected image, and the text-non-text region binary image is refined.

Description

technical field [0001] The invention belongs to an image processing method, in particular to a text detection method of a certificate image in a natural scene. Background technique [0002] The rapid development of Internet technology and the popularization of smart phones have greatly facilitated our lives. In many scenarios, operators need users to upload documents (such as ID cards, business licenses, etc.) to verify the user's identity and qualifications. The user's mobile phone takes a picture of the certificate and uploads it for verification, which is convenient and efficient. Due to the complex background and many environmental interference factors of users taking pictures in natural scenes. There are various shooting backgrounds in natural scenes. Users may shoot in possible life scenes such as desktops and bed sheets with complex textures. These textures are difficult to distinguish from text. There are also cases where the text is partially occluded in the phot...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/20G06K9/62G06N3/04
Inventor 张楠靳晓宁张文文段禹心贺思源
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products