Method, device and equipment for identifying header and footer of electronic file and medium

A technology of electronic documents and identification methods, which is applied in the direction of electronic digital data processing, special data processing applications, instruments, etc., and can solve the problems of long time consumption, slow recognition speed of header and footer, high occupation of system resources, etc.

Pending Publication Date: 2021-02-05
北京方正印捷数码技术有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This application provides a header and footer recognition method, device, equipment and medium for electronic documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and equipment for identifying header and footer of electronic file and medium
  • Method, device and equipment for identifying header and footer of electronic file and medium
  • Method, device and equipment for identifying header and footer of electronic file and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0102] At present, with the development of network technology, electronic files are more and more widely used in people's life. However, in the editing process of electronic documents, it is inevitable to undergo multiple revisions, or the electronic documents will be modified accordingly during the printing and typesetting process of electronic documents. Therefore, it is often necessa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a header and footer identification method and device for an electronic file, equipment and a medium. The method comprises steps of obtaining two to-be-analyzed files, and enabling one of the two files to be obtained based on the other one of the two files; removing the cross-page characters in each file in the file set for multiple times to obtain each residual character string of the cross-page characters in each file; determining each residual character string across pages in one file in the file set, each residual character string across pages in the other file in thefile set, and similarity between the residual character strings and the residual character strings; and determining the maximum similarity in the similarities, determining the removed characters corresponding to the maximum similarity in the cross-pages in each file, and repeating the steps to identify each cross-page for the header and footer of the cross-page in each file. By means of the method, the header and footer recognition speed can be increased, resources occupied by the system are saved, and recognition accuracy is improved.

Description

technical field [0001] The present application relates to electronic document processing technology, and in particular to a method, device, equipment and medium for identifying headers and footers of electronic documents. Background technique [0002] At present, the application of electronic files has been very extensive, and it is often necessary to analyze the file content of electronic files. However, the existence of headers and footers of electronic files will affect the results of file analysis, so it is usually necessary to analyze the headers and footers of electronic files. The identification then analyzes the remainder of the electronic file. [0003] In the prior art, when identifying the header and footer of an electronic file, because there is a difference between the character size of the header and footer and the size of the remaining characters in the electronic file, it can be determined according to the size of the smallest circumscribed rectangle of the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/194G06F16/903
CPCG06F16/90344G06F40/194
Inventor 王雪峰林好谢浩
Owner 北京方正印捷数码技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products