Method and system for converting Word file into EPUB file

A file conversion and file technology, applied in the field of EPUB format files, can solve the problems of loss of directory structure, poor conversion effect, text confusion, etc., to achieve the effect of ensuring integrity

Active Publication Date: 2019-08-02
PEKING UNIV
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In particular, for Word files containing a directory structure, whether it is a file with a navigation label or a file with a directory page without a jump link, the conversion effect of the prior art is not good, and the directory structure is prone to loss, text confusion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for converting Word file into EPUB file
  • Method and system for converting Word file into EPUB file
  • Method and system for converting Word file into EPUB file

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0033] The present invention provides a method and system for converting a Word format file into an EPUB format file. For a Word file in the .docx format, by identifying and processing the directory of the Word source file, the directory structure of the source Word file can be extracted and automatically generated. EPUB e-books. The inventive method mainly comprises five steps of Word file analysis, XML file analysis, Word file splitting, HTML file generation and EPUB file generation, see Figure 6 .

[0034] The implementation method of the present invention is illustrated below by converting a .docx file (hereinafter referred to as document 1) of "Six Chapters of a Floating Life". The specific examples are as follows:

[0035] 1) Obtain and decompress the Word file to be converted to obtain reso...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for converting a Word format file into an EPUB format file. The method is for a Word file in a. Docx format. By identifying and processing the catalog ofthe Word source file, a catalog structure of the source Word file can be identified. An EPUB electronic book can be automatically generated, and the method comprises the steps of Word file analysis, XML file analysis, Word file splitting, HTML file generation and EPUB file generation. According to an EPUB e-book automatic generation method capable of identifying the source Word file directory, theproblems that in the prior art such as poor conversion effect tedious operation of manually adding title directories and low efficiency are solved .The integrity of document content is guaranteed. The conversion effect of documents is improved and working efficiency is improved.

Description

technical field [0001] The present invention relates to document processing technology, in particular to a method and system for converting Word format files into EPUB (Electronic Publication, electronic publishing) format files. Background technique [0002] In the era of digital publishing and "Internet +", with the rapid development of mobile communication and network publishing, e-books are becoming more and more popular and popular. The advent of the digital age has changed people's reading habits. Fragmented reading and mobile reading through e-readers, smartphones and other devices have become accepted and preferred by the public. However, due to differences in devices, platforms, and publishing media, There are various e-book formats emerging in the market, such as TXT, PDF, EPUB, Mobi, Azw3, CEB / CEBX, CAJ, PDG and many more. Among various popular e-book formats, EPUB, as the official standard of the International Digital Publishing Forum (IDPF), is listed as the th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22
CPCG06F40/14G06F40/151
Inventor 高良才陈嘉云汤帜
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products