A method for detecting and correcting garbled characters in PDF documents
A document garbled and document technology, applied in the field of garbled character detection and correction, can solve problems such as time consumption, and achieve the effect of improving the efficiency of garbled character detection
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0020] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the embodiments and accompanying drawings.
[0021] like figure 1 As shown, it is a method flow for PDF document garbled detection and correction, and the method includes:
[0022] Extract all font features in PDF documents;
[0023] According to font characteristics, fonts are divided into normal fonts, garbled fonts and undetermined fonts;
[0024] Extract the dot matrix image of characters in the undetermined font, and calculate the similarity between the dot matrix image and the corresponding code based on the garbled code detection algorithm based on image statistical features, and judge the normal characters or garbled characters in the undetermined font according to the similarity;
[0025] Carrying out vertical and horizontal editing and correction to the garbled characters in the u...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
