Underlined text image preprocessing method and device

A text image and preprocessing technology, which is applied in the field of optical character recognition, can solve problems that affect the correct recognition of characters, cannot be correctly positioned, reduce the recognition rate of characters and the adaptability of the recognition core, and achieve improved recognition rate and strong adaptability Effect

Active Publication Date: 2012-05-09
HANVON CORP
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This method is effective for the underline separated from the character, but for the case where the character and the underline are glued together, the straight line may not be correctly positioned

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Underlined text image preprocessing method and device
  • Underlined text image preprocessing method and device
  • Underlined text image preprocessing method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0047] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0048] The following describes in detail the underlined text image preprocessing method of the present invention with reference to the accompanying drawings and taking the underline processing of English text line characters as an example.

[0049] Such as figure 1 Shown and referenced figure 2 , A specific embodiment of the underlined text image preprocessing method of the present invention includes the following steps:

[0050] Step 1: Re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an underlined text image preprocessing method and device, relating to the field of optical character recognition. The method comprises the following steps: acquiring the position of each text line in a text image; for the position of each text line, searching each text line based on a run-length search method; if the preliminary determination result shows that an underline exists in the text line, locating the position of the initial upper boundary of the underline; by using the initial upper boundary of the underline as an initial pixel line, locating the underline region based on run-length search and connected domain analysis methods; separating out stroke regions of characters from the underline region, thus obtaining a region to be deleted; and setting the foreground information in the region to be deleted into the background, thus obtaining a character region having no underline. By searching each text line based on the run-length search method for the position of each text line, the invention solves the problem that a text having an underline (especially an underline conglutinated with characters) is difficult to recognize, improves the character recognition rate, and enhances the adaptability of the recognition core.

Description

technical field [0001] The invention belongs to the field of optical character recognition (OCR), and relates to an underlined text image preprocessing method and device. Background technique [0002] In printed character recognition, the general processing flow is: first divide the text image into several lines, so that each text line contains only a single line of text; and then further character segmentation and recognition. [0003] If there is an underscore under the character, it will not only affect the normal segmentation of the character, but also cause the character recognition engine to fail to recognize the corresponding character correctly. Therefore, it is usually necessary to remove the underscore below the character before character segmentation and recognition. [0004] In the prior art, a simple straight line detection method (such as Hough transform, etc.) is usually used. If a long straight line is detected under the character image, the image in the lin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/20
Inventor 万鑫刘正珍
Owner HANVON CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products