Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

File analysis method and device, electronic equipment and storage medium

A file parsing and file technology, applied in the field of file parsing, can solve the problems of incompatible file format parsing efficiency, incomplete parsing of nested files, etc., and achieve the effect of avoiding parsing omissions and parsing content comprehensively

Pending Publication Date: 2022-04-26
合肥闪捷信息科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the existing Microsoft Windows API interface, NET NPOI library or Java POI class library and other platforms parse files, there are usually problems such as incomplete parsing of nested files, incompatible file formats, and poor parsing efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File analysis method and device, electronic equipment and storage medium
  • File analysis method and device, electronic equipment and storage medium
  • File analysis method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0070] A cross-platform Office nested file content parsing method is provided, which is executed through the parsing engine, and the method includes:

[0071] S1: The monitoring system Hook leaked office documents;

[0072] S2: The parsing engine loads the file to be parsed;

[0073] S3: The parsing engine parses the file type. If it is ODF OOXML type, go to S4-1; if it is ODFXML type, go to S4-2; if it is OLE type, go to S4-3; if it is XLSB type, go to S4-4; if it is RTF type, go to S4-5 ;

[0074] S4-1: Parse the ODF OOXML file directory;

[0075] S4-2: Parse the ODFXML file directory;

[0076] S4-3: Parse the OLE file directory;

[0077] S4-4: Parse the XLSB file directory;

[0078] S4-5: Parse the RTF file directory;

[0079] S5: Analyze the file attribute information according to the file directory;

[0080] S6: According to the file directory, parse the content of the file body;

[0081] S7: judge whether the current file contains a nested file, if it contains a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a file analysis method and device, electronic equipment and a storage medium, and the method comprises the steps: (A) determining a target Office file which is an Office file operated on terminal equipment in real time; (B) determining a target file type of the to-be-analyzed Office file of the current layer; (C) obtaining a file directory of the Office file to be analyzed according to a file directory analysis logic corresponding to the target file type; (D) according to the file directory, outputting file attribute information and file content of the Office file to be analyzed; (E) according to the file directory, determining whether a nested file exists in the Office file to be analyzed; and (F) if the nested file exists in the to-be-analyzed Office file, obtaining the nested file, and taking the nested file as the to-be-analyzed Office file, so as to return to execute the step (B).

Description

technical field [0001] The present application relates to the technical field of file analysis, and in particular, to a file analysis method, device, electronic equipment, and storage medium. Background technique [0002] The mainstream Office nested file suite providers on the market include Microsoft Office, Kingsoft WPSOffice, Yongzhong Office, etc. There are also some open source software such as Open Office and Libre Office. The above file format types can be classified into three categories: OLE, ODF, and OOXML, and the file formats of different versions are also different. When existing platforms such as Microsoft Windows API interface, NET NPOI library or Java POI class library parse files, there are usually problems such as incomplete parsing of nested files, incompatible file formats, and poor parsing efficiency. Therefore, a cross-platform Office nested file content analysis method with strong compatibility and comprehensive analysis content is needed. Contents...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/11G06F16/16G06F16/178G06F40/205
CPCG06F16/11G06F16/16G06F16/1794G06F40/205
Inventor 张黎吴洋张承伟陈广辉刘维炜
Owner 合肥闪捷信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products