Docx file text content extraction method and device
An extraction method and file technology, which is applied in the field of text content extraction of docx files, can solve problems such as execution speed that needs to be improved, and achieve the effect of increasing extraction speed and improving parsing speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0057]In order to make the above objects, features, and advantages of the present application, the following description will be described in further detail below with reference to the accompanying drawings and specific embodiments.
[0058]In order to understand the technical solutions provided herein, the background art according to the present application will be described.
[0059]The inventors found in the study of traditional parsing DOCX documents, common DOCX file resolutions were: based on Apache POI analysis methods based on COM interface analysis methods.
[0060]Based on the Apache PoI file parsing method, the DOCX file is read using the Java API provided by Apache Poi, obtaining the DOCX file text content, the method has the following deficiencies:
[0061]1, Apache Poi is the Java API, depending on the Java environment, there is a need to install JRE, in some special occasions (such as resource nervous) can not meet the needs;
[0062]2, the DOCX file itself is compressed format, and...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap