Text row arrangement analytical method and device for text area recognition
A text area and line arrangement technology, applied in the field of image processing, can solve the problems of indistinguishable, few lines of text, and inability to accurately extract the number of text, and achieve the effect of improving the accuracy rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] See figure 2 , Which is a method flowchart of a method for analyzing character line arrangement in character region recognition disclosed in Embodiment 1 of the present invention, and the method includes the following steps:
[0040] Step 201: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;
[0041] It should be noted that the "single word" in the word area is a general term, including Arabic numerals and characters in various languages, for example, figure 1 "Promotion" and "sale" in "Promotion" shown in "Promotion" are respectively a single word, "2011" in "2011 Special" is a single word, and "Special" and "Price" are respectively a single word. In addition, the present invention does not limit the method for extracting all the word regions from the manually edited image, and any method in the prior art can be used for extraction.
[0042] After extracting the word area from the manually ...
Embodiment 2
[0078] In order to better distinguish between text areas and non-text areas, and improve the accuracy of line arrangement analysis, the second embodiment is compared with the first embodiment. The difference is that before selecting text lines from the candidate lines, further selecting Extract more features from the rows, and first delete one or more single-character regions that do not meet specific features from the candidate rows. See Figure 5 , Which is a method flowchart of another method for analyzing character line arrangement in character region recognition disclosed in the second embodiment of the present invention, including the following steps:
[0079] Step 501: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;
[0080] Step 502: among the matched candidate pairs, aggregate candidate pairs that meet preset aggregation conditions into a candidate group;
[0081] Step 503: Construct the ...
Embodiment 3
[0098] In order to further distinguish between text areas and non-text areas and improve the accuracy of line arrangement analysis, the third embodiment is compared with the second embodiment. The difference is that the top, bottom, height, and height are deleted from the rows to be selected. After the single word area whose width or spacing does not meet the corresponding preset threshold, the sum of the complexity of the single word area in the row to be selected is further calculated, and one or more rows to be selected are deleted according to the calculated complexity. See Figure 7 , Which is a method flow chart of another method for analyzing character line arrangement in character region recognition disclosed in Embodiment 3 of the present invention, including the following steps:
[0099] Step 701: Extract multiple word regions from the image, and match any two word regions with the closest features to one or more candidate pairs;
[0100] Step 702: among the matched candi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com