Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text row arrangement analytical method and device for text area recognition

A text area and line arrangement technology, applied in the field of image processing, can solve the problems of indistinguishable, few lines of text, and inability to accurately extract the number of text, and achieve the effect of improving the accuracy rate

Active Publication Date: 2014-04-16
ALIBABA GRP HLDG LTD
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, in the process of realizing the present invention, the inventors of the present invention have found that at least the following problems exist in the existing text line arrangement analysis method based on region aggregation: this method still cannot combine text lines with fewer texts in manually edited images with The noise area is distinguished, and the text line with a small number of text cannot be accurately extracted

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text row arrangement analytical method and device for text area recognition
  • Text row arrangement analytical method and device for text area recognition
  • Text row arrangement analytical method and device for text area recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] See figure 2 , Which is a method flowchart of a method for analyzing character line arrangement in character region recognition disclosed in Embodiment 1 of the present invention, and the method includes the following steps:

[0040] Step 201: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;

[0041] It should be noted that the "single word" in the word area is a general term, including Arabic numerals and characters in various languages, for example, figure 1 "Promotion" and "sale" in "Promotion" shown in "Promotion" are respectively a single word, "2011" in "2011 Special" is a single word, and "Special" and "Price" are respectively a single word. In addition, the present invention does not limit the method for extracting all the word regions from the manually edited image, and any method in the prior art can be used for extraction.

[0042] After extracting the word area from the manually ...

Embodiment 2

[0078] In order to better distinguish between text areas and non-text areas, and improve the accuracy of line arrangement analysis, the second embodiment is compared with the first embodiment. The difference is that before selecting text lines from the candidate lines, further selecting Extract more features from the rows, and first delete one or more single-character regions that do not meet specific features from the candidate rows. See Figure 5 , Which is a method flowchart of another method for analyzing character line arrangement in character region recognition disclosed in the second embodiment of the present invention, including the following steps:

[0079] Step 501: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;

[0080] Step 502: among the matched candidate pairs, aggregate candidate pairs that meet preset aggregation conditions into a candidate group;

[0081] Step 503: Construct the ...

Embodiment 3

[0098] In order to further distinguish between text areas and non-text areas and improve the accuracy of line arrangement analysis, the third embodiment is compared with the second embodiment. The difference is that the top, bottom, height, and height are deleted from the rows to be selected. After the single word area whose width or spacing does not meet the corresponding preset threshold, the sum of the complexity of the single word area in the row to be selected is further calculated, and one or more rows to be selected are deleted according to the calculated complexity. See Figure 7 , Which is a method flow chart of another method for analyzing character line arrangement in character region recognition disclosed in Embodiment 3 of the present invention, including the following steps:

[0099] Step 701: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;

[0100] Step 702: among the matched candi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a text row arrangement analytical method and device for text area recognition. The text row arrangement analytical method for text area recognition comprises the steps that any two of individual-character areas are matched to form a candidate pair, wherein the individual-character areas are extracted from an image, and the two individual-character areas of each candidate pair have the most similar characteristic; candidate pairs, meeting preset aggregation conditions, in the candidate pairs formed through matching are aggregated to form a candidate set; the candidate set is set up to be a connected component, a maximum clique is searched in the connected component, and rows formed by the maximum clique serve as rows to be selected; the initial direction of a baseline and the initial direction of an apical line of the rows to be selected are calculated; mid-value separation distances of the individual-character areas in the rows to be selected are calculated according to the direction of the initial row of the rows to be selected, and the row, to be selected, with the minimum mid-value separation distance is selected to serve as a text row; the candidate pair contained in the text row is deleted from the candidate set, and a new text row is continuously selected out. According to the text row arrangement analytical method and device for text area recognition, the text row with the small number of characters can be accurately extracted from a manually-edited image, and accuracy of character recognition is improved.

Description

Technical field [0001] The invention relates to the field of image processing, in particular to a method and device for analyzing character line arrangement in character region recognition. Background technique [0002] The text recognition technology in images has a wide range of practical applications, such as content recognition of scanned documents or automatic postal code recognition. With the promotion of digital cameras and the development of Internet technology, more and more images are generated after manual editing based on the captured images, such as figure 1 As shown, these manually edited images usually have complex background images, changeable foreground colors and textures. In order to recognize the text in these complex manually edited images, the text area needs to be positioned and cut first. Here, "text "Is a general term, including Arabic numerals and characters in various languages, such as Chinese characters or English letters. [0003] In the process of po...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/34G06K9/62
Inventor 郑琪王永攀
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products