Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document positioning and cutting method in picture

A document and picture technology, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problems of document skew, document occupation, and difficulty in keeping the mobile phone desktop, so as to eliminate interference and improve the recognition rate.

Active Publication Date: 2017-07-14
深圳市六六六国际旅行社有限公司
View PDF11 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the limitation of the shooting angle and viewfinder range, the document pictures taken directly by the user often have the following two problems: (1) The document often only occupies the center part of the picture, and there are a lot of invalid background interference around the document, which needs to be removed
(2) When the user is shooting, it is difficult to keep the mobile phone parallel to the desktop where the document is placed, causing the document to be tilted in the image, which needs to be corrected
[0004] In the existing software on the market, some of the four vertices of the document require the user to specify manually. This process requires user interaction, which is inefficient and not suitable for processing a large number of images.
There are also some software that automatically locate the four vertices of the document through image processing, but due to the limitations of the algorithm, misjudgments often occur, and the success rate is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document positioning and cutting method in picture
  • Document positioning and cutting method in picture
  • Document positioning and cutting method in picture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0019] This embodiment provides a method for locating and splitting documents in pictures, such as figure 2 As shown, its method steps include:

[0020] step 1

[0021] The document image of the input picture, we first scale its size to a certain size, and when scaling, keep its long side at 1000 pixels, then convert the image from a color image to a grayscale image, and then extract the straight lines in the grayscale image line segment.

[0022] There are many existing methods for extracting straight line segments. We use the LSD fast line segment detection method in the system. This method is very fast and the effect of detecting straight lines is relatively stable. For example, Figure 3a As shown, the results of the straight line detection detected in the document image 21 include the document horizontal boundary line 22 and the docu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a document positioning and cutting method in a picture. The method comprises steps of S1: inputting a document image; S2: carrying out straight segment detection on the document image, and classifying the detected document segments into nearly horizontal straight segments, nearly vertical straight segments and straight segments in other directions; S3: extracting horizontal boundary segments and vertical boundary segments; S4: according to the extracted horizontal boundary segments and vertical boundary segments, determining positions of four top points of the document image; and S5: according to the positions of the four top points, carrying out cutting and righting on the document image. According to the invention, quite complex document types can be processed; top points can be quite precisely positioned in case of a quite complex background; the method is applied to software for document picture processing; after a user uses a mobile phone to shoot the document, the documents in the image can be quickly cut and righted; interference can be eliminated for following document identification modules; and identification rate of words in the documents is improved.

Description

technical field [0001] The invention relates to the technical field of computer software, in particular to a method for locating and cutting documents in pictures. Background technique [0002] In traditional office processes, the digitization of files or documents is usually done through scanners. With the popularization of smart phones and the improvement of the quality of mobile phone cameras, more and more consumers begin to use mobile phones to take pictures of documents or documents to obtain digital copies of documents. However, due to the limitation of the shooting angle and viewfinder range, the document pictures taken directly by the user often have the following two problems: (1) the document often only occupies the center part of the picture, and there are a lot of invalid background interference around the document, which needs to be cut off. (2) It is difficult for the user to keep the mobile phone parallel to the desktop where the document is placed when shoo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/32
CPCG06V30/412G06V10/24
Inventor 韩智素王珏刘新科谌波
Owner 深圳市六六六国际旅行社有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products