Credential layout analysis method and device

A format and certificate technology, applied in the field of certificate format analysis, can solve the problems of cumbersome development process, uncertain results, and large workload, and achieve the effect of avoiding repeated development, reducing workload, and making small changes.

Active Publication Date: 2016-12-07
CHONGQING ZHONGKE YUNCONG TECH CO LTD
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if this method is used, whenever there is a new certificate format, it is necessary to extract the layout features of the format and set the conditions for format judgment. This is equivalent to a new development process, which must be continuously tested, iterated, The entire development process is cumbersome, the workload is heavy, and the result is uncertain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Credential layout analysis method and device
  • Credential layout analysis method and device
  • Credential layout analysis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] see figure 1 , the present invention provides a flow chart of a method for certificate format analysis, including:

[0045] Step S1, obtaining the certificate image;

[0046] Specifically, the certificate image can be taken by a terminal device connected to a camera or a terminal device with a built-in camera, or it can be an image intercepted by analyzing a video stream or a directly stored certificate image; the terminal device can be, for example, a mobile phone or a tablet computer , PDA (Personal Digital Assistant, personal digital assistant, referred to as: PDA), etc.

[0047] Step S2, extracting the format features in the document image;

[0048] Specifically, extracting the typography feature is composed of the character gradient direction histogram feature, the inter-line distribution feature and the intra-line character inter-character feature.

[0049] Step S3, using a document recognition model to identify each of the format features, and obtain the corre...

Embodiment 2

[0055] Such as figure 2 As shown, the training flowchart of the certificate recognition model in the method for certificate format analysis provided by the present invention includes:

[0056] Step S101, collecting certificate images of different formats among similar certificates;

[0057]Among them, if the document to be analyzed is an ID card, then the document images of different versions of the ID card need to be collected; if the document to be analyzed is a passport, then the document images of different versions of the passport need to be collected; if the document to be analyzed is Bank bills, then different versions of bank bill images need to be collected; according to the different types of documents to be analyzed, different versions of the document images are selected.

[0058] Step S102, extracting all the layouts in each document image and the layout features corresponding to each layout,

[0059] Wherein, each certificate image contains multiple text lines,...

Embodiment 3

[0066] Such as image 3 As shown, the flow chart of step S2 in the method for document format analysis provided by the present invention includes:

[0067] Step S201, performing binary segmentation on the certificate image to obtain corresponding text lines;

[0068] Among them, the purpose of adopting the principle of binarization segmentation is to process the key points in the document image, remove the background by the way when segmenting the image, leave the target object of interest, and facilitate the extraction of text lines; the method of binarization segmentation specifically includes the following Three types, threshold based on pixel value, threshold based on region property or threshold based on coordinate position.

[0069] Step S202, sequentially selecting different character lines to combine to generate multiple layouts, wherein each combination is a layout;

[0070] Wherein, each text line obtained by segmentation is combined to generate a plurality of layo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a credential layout analysis method and device. The method comprises a step of obtaining a credential image, a step of extracting the layout characteristic in the credential image, a step of using a credential identification model to identify each layout characteristic, obtaining a correlation degree level corresponding to the layout characteristic, wherein the credential identification model is obtained after training a training sample set, and a step of screening the correct layout of the credential image with a highest correlation degree level corresponding to the layout characteristic. Through constructing a universal multiple-layout analysis frame, the same type of credentials with different layouts can be identified, even a new layout appears, only the preparation of corresponding credential image data is needed, training is carried out again and the model is updated, the change of an original frame is small, the expansion and integration can be rapidly carried out, thus the repeated development is avoided, the workload of the development is reduced, the development process and result are controllable, the OCR identification of the image is facilitated, and the identification efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of image processing, in particular to a method and device for document format analysis. Background technique [0002] With the development of information technology, there are more and more applications of non-contact authentication based on the network, and the remote identity authentication technology emerges as the times require. Information technology has also been popularized and widely used. This solution has the advantages of low cost, convenient integration, and easy expansion. More and more manufacturers have also launched their own ID photo recognition systems. [0003] At present, ID photo recognition generally includes the following processes: 1. Perform rotation and tilt correction on the ID image; 2. Image denoising, image enhancement and other preprocessing; 3. Layout analysis, information column positioning; 4. Line segmentation and character segmentation; 5. Character recognition; ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/46
CPCG06V10/507G06V10/422
Inventor 周曦周亚飞
Owner CHONGQING ZHONGKE YUNCONG TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products