Unlock instant, AI-driven research and patent intelligence for your innovation.

Generation method of text detection model and text detection method

A text detection and model technology, applied in the field of text detection, can solve the problems of small amount of calculation, consumption of computing resources, complicated post-processing steps, etc., and achieve the effect of improving speed

Active Publication Date: 2021-09-21
BEIJING CENTURY TAL EDUCATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the common text detection methods roughly include two categories. One type of scheme is text detection based on sliding windows. Based on the idea of ​​general object detection, a large number of anchor boxes with different aspect ratios and different sizes are set, and the anchor boxes are used as sliding windows. Perform traversal search on the image or the feature map after the convolution operation of the image. For each searched position box, determine whether there is text in the box. The advantage of this method is that no complicated post-processing is required after the text box is determined. , the disadvantage is that the amount of calculation is large, it consumes computing resources and takes a long time; another kind of scheme is based on the method of calculating the connected domain (or based on the idea of ​​segmentation), using the full convolutional neural network model to extract image features, and performing Binarize and calculate its connected domain, and then use a specific method to determine the position of the text line according to different application scenarios (corresponding to different types of training data sets). The advantage of this method is that the calculation amount is small and the calculation speed is fast. The disadvantage is that the post-processing steps are cumbersome , involves a large number of calculations and optimizations in the post-processing stage. Whether the post-processing strategy is reasonable or not forms an important constraint on the overall performance of the algorithm, and there is a large room for optimization.
[0005] Regarding text detection, there are currently solutions such as "pixel aggregation network" and "differentiable binarization method". They have good performance in some open environment scene text detection, and the speed is very fast. In some specific dense text scenarios, the processing speed is greatly affected by the number of text boxes, and almost linearly decreases with the increase of the number of text boxes, which is difficult to meet the speed requirements of actual application scenarios.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generation method of text detection model and text detection method
  • Generation method of text detection model and text detection method
  • Generation method of text detection model and text detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The principle and spirit of the present application will be described below with reference to several exemplary embodiments. It should be understood that these embodiments are given only to enable those skilled in the art to better understand and implement the present application, rather than to limit the scope of the present application in any way. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Herein, any number of elements in the drawings is used for illustration rather than limitation, and any designation is only for distinction and has no limiting meaning.

[0044] The principles and implementation methods of the present application are explained in detail below with reference to several embodiments of the present application.

[0045] Embodiments of the present application provide a method for generating a text detection model, a text dete...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present application provide a method for generating a text detection model, a text detection method, a device, a storage medium, and a computing device. The method for generating the text detection model includes: constructing a first sub-model to be trained, and using a plurality of marked text images as training data to train the first sub-model; constructing a second sub-model to be trained, and performing training on the trained The image output by the first sub-model is marked with the position category of the text line as the training data, and the second sub-model is trained. The parameters of the second residual network are kept unchanged during training, and the parameters of the two cascaded fully connected layers are Tuning, stop training after reaching the stop condition; cascade the trained first sub-model and the trained second sub-model to obtain a text detection model. Using the embodiment of the present application can realize rapid detection of dense text.

Description

technical field [0001] The present application relates to the technical field of text detection, and in particular to a method for generating a text detection model, a text detection method, a device, a storage medium, and a computing device. Background technique [0002] This section is intended to provide a background or context to the implementations of the application that are recited in the claims. The descriptions herein are not admitted to be prior art by inclusion in this section. [0003] The current "text detection" technology has a wide range of applications and is a pre-step for many computer vision tasks, such as image search, text recognition, identity authentication, visual navigation, etc. The main purpose of text detection is to locate the position of text lines or characters in the image . Compared with the detection of general objects, text has the characteristics of multi-direction, irregular shape, extreme aspect ratio, font, color, background, etc., s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00G06K9/32G06K9/34G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V30/40G06V20/63G06V30/153G06V30/10G06N3/047G06N3/045G06F18/2415
Inventor 秦勇
Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD