Text detection model training method and device, computer equipment and storage medium

A text detection and detection model technology, which is applied in the field of image recognition, can solve the problems of dense text, machine misrecognition, large changes in text scale, etc., and achieve the effect of improving recognition accuracy

Inactive Publication Date: 2020-04-28
上海眼控科技股份有限公司
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, text detection and positioning in natural scenes and documents has problems such as large changes in text scale, multiple directions, variable length and width, and dense text. In addition, due to the complex environment of text detection and positioning, multi-directional texts and curved and inclined texts in natural scenes are often It leads to machine misrecognition. Therefore, the traditional rectangular regression can no longer meet the actual text detection and positioning needs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text detection model training method and device, computer equipment and storage medium
  • Text detection model training method and device, computer equipment and storage medium
  • Text detection model training method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purpose, technical solutions, and advantages of this application clearer, the following further describes this application in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the application, and not used to limit the application.

[0042] At present, there are mainly two traditional text detection and positioning methods, one is based on the idea of ​​pixel segmentation, and the other is based on the idea of ​​anchor box regression. Among them, the idea of ​​pixel segmentation is mainly to use the idea of ​​scene segmentation to detect and locate the text, so as to find the boundary and vertices; while the main idea of ​​polygon regression based on anchor box regression is to return the coordinates of four points or return to two vertices. The above methods are only returning more coordinates and increasing the information of the network at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text detection model training method and device, computer equipment and a storage medium. The method comprises the following steps: acquiring a sample image with a text annotation box; performing feature extraction on the sample image, inputting the sample feature map into an initial text detection model to obtain a text prediction box of the sample image and a corresponding second coordinate matrix, and determining a loss value of the second coordinate matrix corresponding to the text prediction box according to a preset loss function and the first coordinate matrixof the text annotation box; and training an initial text detection model according to the loss value. A polygonal text annotation box with any angle is arranged in a sample image used during training; therefore, the directivity and the scale change of the text can be learned in the process of training the model, and when the model is used for text positioning detection, the target textbox which is matched with the text direction in the image and has any angle can be output, so that the recognition precision of a subsequent machine is improved.

Description

Technical field [0001] This application relates to the field of image recognition technology, in particular to a training method, device, computer equipment and storage medium of a text detection model. Background technique [0002] With the development of artificial intelligence technology, more and more tedious tasks are replaced by machines. For example, the recognition and input of some text images is not only time-consuming and labor-intensive, but also causes problems due to fatigue and negligence if it is only done manually. Therefore, the positioning and detection of the text in the text image can be automatically completed by the machine. [0003] However, text detection and positioning in natural scenes and documents have problems such as large changes in text scale, multiple directions, variable length and width, and dense text. In addition, the environment for text detection and positioning is more complicated, which makes multi-directional text and bending and tilting...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/20G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V30/40G06V10/225G06V10/464G06N3/045G06F18/253
Inventor 周康明谈咏东
Owner 上海眼控科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products