Text image processing method and device

A text image and processing method technology, applied in the field of character recognition, can solve the problems of broken strokes, split into one piece, low versatility and accuracy, and achieve the effect of improving accuracy and versatility

Active Publication Date: 2017-07-11
TENCENT TECH (SHENZHEN) CO LTD
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing character segmentation methods can segment characters to a certain extent, but they are often limited in practical applications. On the one hand, the projection segmentation method will cause multiple characters to be segmented into one piece when the characters themselves are inclined. , while template matching is less useful, because it can only be used in specific text occasions
[0005] O

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text image processing method and device
  • Text image processing method and device
  • Text image processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Typical embodiments that embody the features and advantages of the present invention will be described in detail in the following description. It should be understood that the present invention is capable of various changes in different embodiments without departing from the scope of the present invention, and that the description and illustrations therein are illustrative in nature and not limiting. this invention.

[0033] As mentioned above, in the text image processing methods adopted by various text recognition applications, the realization of character segmentation is usually only accurate in specific scenarios, while the accuracy of character segmentation in other scenarios is low, which affects the text. Identify the accuracy of content recognition in applications.

[0034] Therefore, in order to ensure the versatility and accuracy, a text image processing method is proposed. The method is implemented by a computer program, and correspondingly, the constructed...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text image processing method which comprises: preprocessing a text image to obtain a binary image and a plurality of connected domains included in the binary image; obtaining convex hulls corresponding to the plurality of connected domains and a character zone circumscribing the convex hulls by using a convex hull algorithm; and subjecting the obtained character zone to character segmentation in the horizontal direction to obtain a plurality of character blocks distributed in the binary image; and merging the character blocks according to the heights of the character blocks in the binary image to obtain word blocks included in the text image. In addition, there is provided a text image processing device matching the method. The text image processing method and device can improve the universality and the accuracy of character segmentation.

Description

technical field [0001] The invention relates to the technical field of character recognition, in particular to a text image processing method and device. Background technique [0002] In text image processing, character segmentation plays an extremely important role in the field of character recognition. It is mainly to segment characters at the location of characters on the basis of obtaining the text area of ​​the image. [0003] Existing character segmentation includes projection segmentation, clustering and template matching. Among them, the projection segmentation method will use the binarized image after image preprocessing to determine the area where the character is located through projection; the clustering method uses the connected area of ​​the character, and the character distribution characteristics of the entire page are used to determine the connected area. The character blocks of the region are merged; while the template matching method is mainly applied to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06V30/148G06V30/10
CPCG06V10/267G06V30/10G06V30/153G06V30/148G06V30/15G06V30/18086G06T7/11G06T7/60G06T11/60G06V10/26G06V30/414
Inventor 周龙沙王红法
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products