Classification and Information Extraction Method of Formatted Fax Based on OCR
An information extraction and fax technology, applied in the field of image processing, can solve the problems of inability to realize fax image classification and information extraction, inability to scan faxes, and difficulty in extracting key information, so as to improve office work efficiency, high accuracy of information extraction, The effect of fast classification
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0044] The following is based on figure 1 The specific embodiment of the present invention is further described:
[0045] see figure 1 , this embodiment is applicable to any formatted fax, wherein the formatted fax is an image fax with a form. This embodiment takes the fax of a bill as an example, and the details are as follows:
[0046] A method for classifying and extracting information of formatted faxes based on OCR, specifically comprising the following steps:
[0047] Step 1: Obtain the faxed image file of the bill, perform adaptive threshold binarization on the image, and reduce noise interference;
[0048] Step 2: Determine the inclination angle of the image, and correct the image;
[0049] Step 3: Find the outline of the largest bounding box of the table in the corrected image, and intercept the banknote area of the image from the upper area of the largest bounding box of the table in the image;
[0050] Step 4: filter the font outlines in the header area and ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
