Stroke width extraction method and device and character recognition method and system

A technology of stroke width and extraction method, which is applied in the field of character recognition to achieve the effect of ensuring speed and improving accuracy

Active Publication Date: 2013-12-18
ALIBABA GRP HLDG LTD
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] This application provides a stroke width extraction method and device to solve the problem of stroke extraction accuracy
[0007] Correspo

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Stroke width extraction method and device and character recognition method and system
  • Stroke width extraction method and device and character recognition method and system
  • Stroke width extraction method and device and character recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] In order to make the above objects, features and advantages of the present application more obvious and comprehensible, the present application will be further described in detail below in conjunction with the accompanying drawings and specific implementation methods.

[0054] To solve the problem of stroke width extraction accuracy, the present application provides an erosion-based stroke width extraction method. This method extracts the contour perimeter of the connected components before each corrosion to form a perimeter histogram, and calculates the stroke length through the difference of the perimeter histogram, and the stroke width histogram is composed of the stroke length. The analysis of large values ​​identifies whether it is a text area, and then extracts the stroke width of the text area.

[0055] The following first introduces several concepts commonly used in image processing, as follows:

[0056] Binary image: refers to a digital image in which each pix...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a stroke width extraction method and device and a character recognition method and system and solves the problem of accuracy in stroke extraction. The stroke width extraction method includes: extracting an original connected component of a stroke and allowing the component to correspond to a candidate stroke width; subjecting to the original connected component to corrosion calculation, calculating outline perimeter of the connected component before each time of corrosion, and forming a perimeter histogram; using the candidate stroke width that the connected component corresponds to after each time of corrosion calculation to perform difference calculation on the perimeter histogram so as to obtain stroke length of each candidate stroke width; forming a stroke width histogram by the stroke lengths that the candidate stroke widths correspond to; determining whether the original connected component is a character region or not according to a maximum of the stroke width histogram; if yes, determining the stroke width of the character region according to the maximum. The stroke width extraction method and device and the character recognition method and system have the advantages that stroke width extraction can be more accurate and calculation speed is increased.

Description

technical field [0001] The present application relates to the technical field of character recognition, in particular to a stroke width extraction method and extraction device, and a character recognition method and recognition system. Background technique [0002] Text recognition technology in images has a wide range of applications, such as content recognition of scanned documents, automatic postal code recognition, etc. With the promotion of digital cameras and the development of Internet technology, there are more and more images captured in natural scenes and images generated by artificial editing. These images have complex background images, variable foreground colors and textures, and the text in them also has interferences such as multi-language, multi-font, non-linear arrangement, etc. In order to recognize the text in these complex images, it is first necessary to locate and cut the text area. [0003] Strokes are an important feature of text, and stroke width i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/46G06K9/20
Inventor 郑琪王永攀
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products