Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for converting LaTeX document into Word document

A document conversion and document technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of single conversion function and low practical use value, reduce difficulty and complexity, and improve scientific research work efficiency. Effect

Active Publication Date: 2019-08-20
CHINA UNIV OF GEOSCIENCES (WUHAN)
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a method and system for converting LaTeX documents to Word documents, which can realize different types of Convert between documents, reduce the difficulty of multi-document presentation, and improve the efficiency of document use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for converting LaTeX document into Word document
  • Method and system for converting LaTeX document into Word document
  • Method and system for converting LaTeX document into Word document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to have a clearer understanding of the technical features, objects and effects of the present invention, the specific embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

[0056] A method for converting LaTeX documents to Word documents, such as figure 1 shown, including:

[0057] S1. The user submits the LaTeX source file to the system;

[0058] S2, the system opens the LaTeX source file;

[0059] S3. Perform initial analysis on the text, pictures, tables, and formula data elements in the source file through the JACOB component, obtain the category of each data element and the relative position information in the source document, and record the analyzed category and position parameters;

[0060] S4. Use Apache POI and JACOB technology to extract various data elements in the source file;

[0061] S5, use the Naive Bayes algorithm to classify and determine the extracted text elements to form a corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and system for converting a LaTeX document into a Word document. The method comprises the following steps: performing initial analysis on data such as a text, a picture, a formula, a table and the like in a document by utilizing a JACOB technology; extracting data elements in the source file through the Apache POI technology and the JACOB technology, and recording relative position information of all the elements; classifying the extracted text elements according to a naive Bayes algorithm, and realizing conversion of a source file formula based on a cascading automatic encoder; combining the relative position information with each data element to form information flow of the Word target document; and writing the information flow into a target document so asto convert the information flow into a final Word document. According to the invention, the difficulty and complexity of converting the Late Office Word document into the Microsoft Office Word document can be reduced; and a user can conveniently convert a complex scientific and technical document format into a simple Word format, so that the scientific research work efficiency is improved, and the method and system for converting a LaTeX document into a Word document fill a gap in the field of intelligent conversion from a LaTeX document to a Microsoft Office document in China at present.

Description

technical field [0001] The invention relates to the field of document conversion and data processing, in particular to a method and system for converting a LaTeX document to a Word document. Background technique [0002] TeX provides a set of powerful and very flexible typesetting language, it has up to 900 instructions, and TeX has macro functions, users can continuously define their own applicable new commands to expand the functions of the TeX system. LaTeX, developed by Leslie Lamport, is the most popular and widely used set of TeX macros in the world today. As the core program of the Office suite, Microsoft Office Word provides many easy-to-use document creation tools and is currently the largest word processor on the market. The Word-specific file format, the Word file (.docx), has become the de facto most common document standard. Document conversion is to convert Word, Pdf, Txt, Ooxml, Odf, Html and other document formats. For example, the method of converting Oox...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22
CPCG06F40/151
Inventor 宋军徐衡朱超群彭艳曹威张坤吴雅笛
Owner CHINA UNIV OF GEOSCIENCES (WUHAN)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products