Text detection model training method and device, readable storage medium and equipment

A text detection and training method technology, applied in neural learning methods, biological neural network models, character and pattern recognition, etc., can solve problems such as model training divergence, unstable training process, and large noise

Active Publication Date: 2021-04-20
BEIJING CENTURY TAL EDUCATION TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the division standard of difficult and easy samples of the FocalLoss method relies too much on the confidence of the model prediction. When a training sample is mislabeled, the adaptive weighting strategy of the FocalLoss method has a large weight difference between different samples, which is likely to cause unstable training. process, and even bring about the problem of model training divergence
Especially for the text detection task of the segmentation method, there is no clear texture boundary between the text area and the non-text area, and there must be a lot of noise in the labeling process. The FocalLoss method that relies on confidence to define difficult and easy samples is difficult to play a role.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text detection model training method and device, readable storage medium and equipment
  • Text detection model training method and device, readable storage medium and equipment
  • Text detection model training method and device, readable storage medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementation methods. It should be understood that the specific implementation manners described here are only used to explain relevant content, rather than to limit the embodiments of the present invention. It should also be noted that, for ease of description, only parts related to the embodiments of the present invention are shown in the drawings.

[0062] It should be noted that, in the case of no conflict, implementation manners and features in the implementation manners in the embodiments of the present invention may be combined with each other. Embodiments of the present invention will be described in detail below with reference to the drawings and in combination with implementation manners.

[0063] It should be noted that the numbering of the steps in the text is only for the convenience of explanation of the specific embodiments, and does ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a text detection model training method and device, a readable storage medium and equipment. The training method comprises the following steps: inputting a to-be-processed sample image into a convolutional network model to obtain a predicted value; obtaining an annotation value of the sample image; obtaining predicted loss according to the annotation value, the predicted value and the loss function; adjusting parameters of the convolutional network model according to the prediction loss, wherein the loss function comprises a simple sample judgment function and a weight coefficient function; the simple sample judgment function is used for filtering the sample images with the prediction confidence greater than a preset first threshold and the sample images with the prediction confidence less than a preset second threshold, and the weight coefficient function is used for adjusting the weight of the unfiltered sample images. According to the embodiment of the invention, a simple positive sample and a simple negative sample can be filtered, and the model can pay attention to a more valuable sample image in combination with the adjustment of the weight value.

Description

technical field [0001] The invention relates to the technical field of text detection model training, in particular to a text detection model training method, device, readable storage medium and equipment. Background technique [0002] In intelligent education scenarios, the positioning of image text areas is the pre-process for text recognition and content understanding, and the detection accuracy of text lines directly affects the processing effect of subsequent tasks. At present, text detection models based on deep learning are divided into two categories: regression methods based on preset boxes and pixel segmentation methods based on text regions. Among them, the pixel segmentation method based on the text area has strong adaptability, and has obvious advantages for slender text and curved text. In the text detection model based on the pixel segmentation method, the text segmentation task divides the image into text areas and non-text areas, which is a typical binary c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/20G06K9/62G06N3/04G06N3/08
Inventor 王德强刘霄熊泽法
Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products