A PDF file category judgment method and a character extraction method
A technology of file category and determination method, applied in the field of content recognition, can solve the problems of inability to be used for secondary editing, automatic translation, inability to automatically determine the file category, and lack of file universality, etc., and achieve rapid positioning of file categories and text extraction High efficiency and improve the effect of automation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] All features disclosed in this specification, or steps in all methods or processes disclosed, may be combined in any manner, except for mutually exclusive features and / or steps.
[0039] Any feature disclosed in this specification (including any appended claims, abstract), unless otherwise stated, may be replaced by alternative features that are equivalent or serve a similar purpose. That is, unless expressly stated otherwise, each feature is one example only of a series of equivalent or similar features.
[0040] Refer to attached figure 1 , the present embodiment discloses a method for determining the category of a PDF file, which can determine whether the PDF file is an image file or a text file, and the determination includes the following steps:
[0041] A. Read the production program of the PDF file; according to the reading result, judge whether the PDF file is a picture or a non-picture, and if it is not a picture, proceed to the next step.
[0042] The produ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
