Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text positioning method and system based on visual structure attribute

A technology of text positioning and structural attributes, applied in the field of image recognition, can solve the problems of improvement and false detection into text areas, etc.

Active Publication Date: 2015-03-25
SHENZHEN UNIV
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of the edge-based method is that the time complexity is low, which can meet the needs of real-time applications, and is less affected by the text color, but the false detection rate will increase when the background texture is complex, for example, there are some regular arrangements in the background. Objects are easily misdetected as text areas

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text positioning method and system based on visual structure attribute
  • Text positioning method and system based on visual structure attribute
  • Text positioning method and system based on visual structure attribute

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0088] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0089] The main purpose of the present invention is to extract texts from complex backgrounds, so as to do pre-processing for subsequent recognition, thereby improving the recognition rate. From the perspective of text visual attributes, the present invention detects abundant closed edges through color polarity difference transformation and edge-neighborhood end bonding to obtain sufficient candidate connected elements, and then through text stroke attribute features and text community attribute features Screening, extracting the connected components belonging to the text from the candidate connected components, and then positioning the final text through multi-channel fusion and removing repeated connected components. The text can be recognized by...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of image recognition, and particularly relates to a text positioning method and system based on the visual structure attribute. Based on the visual attribute of a text, by means of color polarity difference transformation and edge neighborhood tail end bonding, abundant closed edges are detected so that abundant candidate connection elements can be obtained, then character stroke attributive character and text colony attributive character screening is conducted, the connection elements belonging to characters are extracted from the candidate connection elements, and then the final text is positioned through multi-channel blending and repeated connection element removal. The method is high in robustness and can be adapted to the situation that multiple word language categories are mixed, or various font styles exist, or arrangement directions are random, or background interference exists and other situations, the positioned text can be directly provided for OCR software for recognition, and OCR software recognition rate can be increased. The text positioning method and system based on the visual structure attribute can be applied to image video retrieval, junk information blocking, vision assisted navigation, street view positioning, industrial equipment automation and other fields.

Description

technical field [0001] The invention belongs to the technical field of image recognition, and in particular relates to a text positioning method and system based on visual structure attributes. Background technique [0002] The semantic information contained in the text in images and videos is an important clue to understand the content of images and videos, and can be widely used in many fields such as image and video retrieval, auxiliary visual navigation, and industrial equipment automation. However, the OCR (Optical Character Recognition, Optical Character Recognition) recognition rate of text in complex backgrounds is not high, and a general and efficient text region location and text segmentation method is still a technical problem that needs to be solved urgently. [0003] For text images in complex backgrounds, the text must be positioned, text segmented, and perspective corrected before OCR recognition, in order to better complete the extraction of text information....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/20
CPCG06V10/22
Inventor 王娜李霞翟芳冬
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products