Method and device for recognizing text areas in image

A technology in text area and image, applied in character recognition, image enhancement, image analysis, etc., can solve the problem of incorrect order of text lines in the recognition result, and achieve the effect of ensuring the order of text lines

Active Publication Date: 2018-03-09
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, the existing character recognition technology has the problem that the sequence of characters in the recognition result is incorrect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for recognizing text areas in image
  • Method and device for recognizing text areas in image
  • Method and device for recognizing text areas in image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0030] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0031]Please refer to figure 2 , which shows a flow 200 of an embodiment of the method for identifying text regions in an image according to the present application. The described method for identifying a text area in an image comprises the following steps:

[0032] Step ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application discloses a method and a device for recognizing text areas in an image. A specific embodiment of the method comprises: acquiring a color value and position information of each pixelin the image to be recognized; clustering the pixels based on the color value of each pixel, wherein the color values of the pixels in each pixel category are the same or similar; for each pixel category after clustering, determining the contour of each connected area constituted by the pixels in the pixel category to obtain a contour set; and merging the contours based on the color value and position information of each contour in the contour set to obtain respective text areas in the image. The embodiment improves the accuracy of text line sequence recognition in image character recognition.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to the field of pattern recognition technology, and in particular to a method and device for recognizing text regions in an image. Background technique [0002] Optical Character Recognition (OCR), also known as text recognition, refers to the technology of recognizing characters in images. [0003] However, for images with mixed graphics and text, complex typesetting and various styles, the existing character recognition technology can only recognize character lines and characters when recognizing them, but cannot judge the order between character lines and character lines. Suppose the picture to be recognized is as figure 1 As shown, the general OCR system will sort the recognized text from top to bottom and from left to right, so figure 1 The sequence of text recognized in is "Title 1 Chapter 3 Chapter 2 Chapter 4", but actually in the typesetting of the original im...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06K9/46G06V30/10
CPCG06V10/22G06V10/44G06V10/56G06V30/10G06V30/413G06V10/763G06F18/232G06T7/50G06T7/73G06T2207/10024G06T2207/20221
Inventor 陈鑫高建忠雷成军吴冬雪杨琳琳程涛远
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products