Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Segmentation Method of Printed Uyghur Documents Based on Morphology and Integral Projection

A technology based on morphology and integral projection, which is applied in the directions of instruments, calculations, character and pattern recognition, etc., can solve the problems of character missing segmentation, flexibility limitation, etc., to overcome flexibility limitation, use a wide range, and improve segmentation The effect of accuracy

Active Publication Date: 2019-03-08
XIDIAN UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this method can segment the line document image in the entire Uyghur document image, the method still has the disadvantage that: this method sets a threshold in the line segmentation step to distinguish whether it is line spacing or Intra-line spacing limits the flexibility of this method; when character segmentation, there are some over-segmentation and omission-segmentation problems, which will be in the form of Such characters are over-segmented and will look like Such upper and lower covered characters are split when splitting
Although this method can avoid the missing segmentation when there is upper and lower coverage, the disadvantage of this method is that it will also affect the shape such as This character causes the problem of missing segmentation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Segmentation Method of Printed Uyghur Documents Based on Morphology and Integral Projection
  • Segmentation Method of Printed Uyghur Documents Based on Morphology and Integral Projection
  • Segmentation Method of Printed Uyghur Documents Based on Morphology and Integral Projection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] The present invention will be further described below in conjunction with the accompanying drawings.

[0070] Refer to attached figure 1 , to further describe the specific steps of the present invention.

[0071] Step 1, input a binary image.

[0072] Input a binary image of a printed Uyghur document with a width and height of 2362×3327, which is noiseless and non-slanted.

[0073] Step 2, get row document image.

[0074] Using the morphological expansion algorithm, the input binary image is expanded to obtain an expanded image in which the characters belonging to the same document line in the printed Uyghur document image overlap each other.

[0075] A four-neighborhood seed-filled connected domain algorithm is used to extract each connected domain of the dilated image.

[0076] Using the upper side of the circumscribed rectangle of each connected domain as the upper boundary of each row document image, and the lower side as the lower boundary of each row document ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for segmenting printed Uyghur documents based on morphology and integral projection, which mainly solves the problem of flexibility limitation when acquiring line document images in existing segmentation methods, and the problem of such characters when acquiring single-character images The missing segmentation problem can improve the segmentation accuracy of printed Uyghur documents.

Description

technical field [0001] The invention belongs to the field of character segmentation in optical character classification, and further relates to a method for segmenting printed Uighur documents based on morphology and integral projection in the field of character segmentation in optical character classification. The invention can be used to segment the paper Uyghur document image scanned by a scanner into individual Uyghur character images, and do precondition work for the segmentation-based printed Uyghur document recognition. Background technique [0002] At present, printed Uyghur document recognition based on segmentation is widely used. Therefore, accurately segmenting Uyghur characters from Uyghur document images is the premise and basis for printed Uyghur document recognition. However, because the Uyghur language borrows the writing form of Arabic and Persian alphabets, which belongs to the cohesive phonetic alphabet, and its shape is similar to our Chinese cursive sc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/34
CPCG06V30/153G06V30/293
Inventor 卢朝阳王小弟李静郎潇艾合买提·阿卜力皮孜
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products