A method and a system for realizing conversion from a Word document to a LaTeX document based on JAVA

A document conversion and document technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as difficulty, difficulty and complexity, document writing and typesetting, etc., to reduce difficulty and complexity, make up for Field gaps and the effect of improving work efficiency

Active Publication Date: 2019-06-21
北京安证通信息科技股份有限公司
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of implementing the present invention, the inventors found that the existing document conversion mainly has the following three types of problems in terms of technology and user use: First, the existing document conversion technology is generally aimed at a small number of source format documents and specific target format documents. Single function, for users, the actual use value is not high
Secondly, it is difficult to convert documents with different encoding methods, such as the conversion problem between Microsoft Office Word and LaTeX documents
Finally, the LaTeX document is composed of the markup language of the Tex language. To make a complete LaTeX document, it is necessary to master almost all the descriptive rules of the TeX language and the ability to write code. For non-professionals, document writing and typesetting have high difficulty and complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and a system for realizing conversion from a Word document to a LaTeX document based on JAVA
  • A method and a system for realizing conversion from a Word document to a LaTeX document based on JAVA
  • A method and a system for realizing conversion from a Word document to a LaTeX document based on JAVA

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to have a clearer understanding of the technical features, objects and effects of the present invention, the specific embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

[0049] Please refer to figure 1 , it is the flow chart of Word document to LaTeX document conversion; A kind of method based on JAVA that the present invention proposes to realize is converted from Word document to LaTeX document, specifically comprises the following steps:

[0050] S1. According to the Word source document file submitted by the user, open the source document file through the Word calling program module in the JACOB component.

[0051] S2. In the opened source document file, the JACOB component is used to initially analyze various data elements in the source document file, and the data information of each data element in the source document file is acquired and recorded; the acquired and recorded data information i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for converting a Word document into a LaTeX document, and the method comprises the steps: submitting a Word document file according to a user, and carryingout the initial analysis of text, pictures, formulas, tables and other data in the file through employing a JACOB technology by the system; extracting Data elements in the source file through the Apache POI technology and the JACOB technology, and recording relative position information of all the elements; Classifying the extracted text elements according to a naive Bayes algorithm, and realizingconversion of a source file formula based on a cascading automatic encoder; Combining the relative position information with each data element to form information flow of the LaTeX target document; And writing the information flow into a target file so as to convert the information flow into a final LaTeX document. According to the method, the difficulty and complexity of converting the Word document into the LaTeX document can be reduced, a professional document conversion method is provided for college teachers, university students, college researchers and the like, and the working efficiency of document processing is improved.

Description

technical field [0001] The invention relates to the field of document conversion and data processing, and more particularly, to a method for realizing the conversion from Word document to LaTeX document based on JAVA. Background technique [0002] TeX provides a set of powerful and very flexible typesetting language, it has up to 900 instructions, and TeX has macro functions, users can continuously define their own applicable new commands to expand the functions of the TeX system. LaTeX, developed by Leslie Lamport, is the most popular and widely used set of TeX macros in the world today. As the core program of the Office suite, Microsoft Office Word provides many easy-to-use document creation tools and is currently the largest word processor on the market. The Word-specific file format, the Word file (.docx), has become the de facto most common document standard. Document conversion is to convert Word, Pdf, Txt, Ooxml, Odf, Html and other document formats. For example, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22
Inventor 宋军徐衡朱超群彭艳张坤曹威吴雅笛
Owner 北京安证通信息科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products