A Text Detection Post-processing Method Based on Geometric Features

A text detection and geometric feature technology, which is applied in image data processing, image analysis, instruments, etc., can solve the problems that the final prediction frame cannot reach the position, the processing is not in place, and the post-processing is less, so as to achieve the improvement effect and increase the accuracy rate , the effect of improving precision and recall

Active Publication Date: 2022-06-28
SOUTHWEST UNIV
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical applications, the final prediction frame is often not very accurate, so many algorithms also perform other post-processing, such as merging adjacent candidate frames
[0004] The current deep learning text detection method has less post-processing, and the obtained prediction frame can generally obtain better detection results, but there are also many details that have not been processed in place.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Text Detection Post-processing Method Based on Geometric Features
  • A Text Detection Post-processing Method Based on Geometric Features
  • A Text Detection Post-processing Method Based on Geometric Features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0089] In order to verify the effectiveness of the method of the present invention, the present invention conducts experiments on three data sets. The three datasets are: Canonical Yi dataset, Chinese2k dataset and English2k dataset. The canonical Yi dataset is manually annotated by the team, while the Chinese2k dataset and English2k dataset are publicly released datasets.

[0090] Canonical Yi dataset

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a text detection post-processing method based on geometric features. The post-processing method is used to post-process the original prediction frame. The original prediction frame is obtained by a deep learning text detection method. The post-processing method includes the following Steps: S1. Based on the background removal algorithm, remove the redundant background in the prediction frame without changing the intersection area of ​​the prediction frame and the character area; S2. Based on the candidate frame expansion algorithm, expand the obtained prediction frame in a certain order , so that it can completely extract the character area; S3, based on the non-standard frame removal algorithm, remove the non-standard prediction frame; S4, based on the repeated frame removal algorithm, remove the obtained repeated prediction frame to obtain the final prediction frame. This method can effectively solve the problems of character misrecognition, detection deviation, overlapping detection, etc. during text detection, and make the detection results more excellent.

Description

technical field [0001] The invention mainly relates to the related technical field of document image processing, in particular to a post-processing method for text detection based on geometric features. Background technique [0002] Document images are important carriers of information and play an important role in daily life. With the wide application of digitization in various fields, humans hope that machines can also imitate the ability of humans to read books, so the optical character recognition technology OCR (Optical Character Recognition) came into being. And text detection is an indispensable part of character recognition technology OCR, which is very important for subsequent text recognition. Efficient and accurate text detection has important applications in the field of document images, including character recognition systems, multilingual translation of images, and human-computer interaction. [0003] At present, the deep learning text detection method mainly...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/22G06V10/774G06V30/414G06K9/62G06T7/11G06T7/136G06T7/194
CPCG06T7/194G06T7/136G06T7/11G06V30/414G06V10/22G06F18/214
Inventor 邱小刚赵富佳林小渝陈善雄李然康王定旺
Owner SOUTHWEST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products