Unlock instant, AI-driven research and patent intelligence for your innovation.

Identification method and device for Arabic character

An Arabic and recognition method technology, applied in the field of optical character recognition, can solve the problems of lower recognition rate, sticking, errors, etc., and achieve the effect of reducing the character set and improving the recognition rate

Active Publication Date: 2014-04-16
HANVON CORP
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in actual use, due to reasons such as typesetting and noise, a concatenated character segment may often break and become two conjoined character segments, or several conjoined character segments? Segments may be concatenated. In these cases, the concatenated characters? The head of a segment may not be in the form of the first character, and the tail may not be in the form of the end character. If the candidate set for recognition still uses the form of the end character of the character, errors may occur, resulting in a reduction in the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identification method and device for Arabic character
  • Identification method and device for Arabic character
  • Identification method and device for Arabic character

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The invention proposes an Arabic character recognition method, which introduces a fuzzy character form recognition method in the character recognition process. If the recognition of the specified character form is performed at a position where the specific character form is uncertain, errors may occur, and the recognition of fuzzy character forms will expand the scope of recognition and make the recognition more accurate.

[0032] The character forms of Arabic characters are basically divided into: the initial character form (ini), the middle character form (med), the last character form (fin), and the independent character form (iso). The present invention adopts fuzzy character form to represent for some undetermined characters of specific character form, such as the first middle character form (inimed) indicates that the character may be an initial character form or an intermediate character form, and the middle tail character form (medfin) indicates that a character ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an identification method and device for an Arabic character, and belongs to the field of optical character recognition. The method comprises the following steps of: 1, estimating a baseline position and a baseline height for an input line image; 2, searching a segmentation point of the input line image to obtain a block sequence, and recording a connected body to which each block belongs; 3, determining character attribute of each block according to positional information of the block in the connected body; 4, merging blocks for the block sequence and performing fuzzy identification according to character form of the merged blocks to obtain identification assessment; and 5, selecting a merging combination with the optimal overall identification assessment as identification result output. In the invention, the identification method in the fuzzy character form is introduced in a character identification process, so that the range of identification is expanded, and the identification is more accurate.

Description

technical field [0001] The invention belongs to the field of optical character recognition, and relates to a recognition method and device, in particular to an Arabic character recognition method and device. Background technique [0002] Standard Arabic has 28 basic characters, and Uyghur has 32 basic characters. According to the position in the word, each Arabic character has 1-4 writing forms: independent form, initial character form, middle character form and In addition, the character line direction of the Arabic character set is from right to left, and characters are connected before and after to form one or several concatenated character segments. In the character segment, characters are connected along the baseline. [0003] The general printed Arabic recognition system is the same as the general OCR system, which requires image preprocessing, line segmentation, character segmentation, single character recognition and other processes. Due to the different character f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06K9/54
Inventor 王琛刘正珍钮兴昱
Owner HANVON CORP