Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and system for converting latex documents to word documents

A document conversion and document technology, applied in the direction of instruments, calculations, electrical digital data processing, etc., can solve the problems of single conversion function and low actual use value, reduce difficulty and complexity, and improve the efficiency of scientific research work

Active Publication Date: 2020-11-27
CHINA UNIV OF GEOSCIENCES (WUHAN)
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a method and system for converting LaTeX documents to Word documents, which can realize different types of Convert between documents, reduce the difficulty of multi-document presentation, and improve the efficiency of document use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for converting latex documents to word documents
  • A method and system for converting latex documents to word documents
  • A method and system for converting latex documents to word documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to have a clearer understanding of the technical features, purposes and effects of the present invention, the specific implementation manners of the present invention will now be described in detail with reference to the accompanying drawings.

[0056] A method for converting LaTeX documents to Word documents, such as figure 1 shown, including:

[0057] S1. The user submits the LaTeX source file to the system;

[0058] S2. The system opens the LaTeX source file;

[0059] S3. Initially analyze the text, pictures, tables, and formula data elements in the source file through the JACOB component, obtain the category of each data element and the relative position information in the source document, and record the analyzed category and position parameters;

[0060] S4, using Apache POI and JACOB technology to extract various data elements in the source file;

[0061] S5. Use the naive Bayesian algorithm to classify and judge the extracted text elements to form the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and system for converting LaTeX documents to Word documents, using JACOB technology to initially analyze data such as text, pictures, formulas, and tables in the file; using Apache POI and JACOB technology to extract data elements in the source file , and record the relative position information of each element; classify the extracted text elements according to the naive Bayesian algorithm, and convert the source document formula based on the cascaded autoencoder; combine the relative position information with each data element, Form the information flow of the Word target document; write the above information flow into the target file, thereby converting it into the final Word document. The invention can reduce the difficulty and complexity of converting from Latex documents to Microsoft Office Word documents, facilitate users to convert complex scientific and technological document formats into simple Word formats, and improve the efficiency of scientific research work. This invention fills the current domestic LaTeX document format Field blank for intelligent conversion to Microsoft Office Word documents.

Description

technical field [0001] The invention relates to the fields of document conversion and data processing, in particular to a method and system for converting a Latex document into a Word document. Background technique [0002] TeX provides a powerful and flexible typesetting language with up to 900 instructions, and TeX has a macro function, users can continuously define their own applicable new commands to expand the functions of the TeX system. LaTeX developed by Leslie Lamport is the most popular and widely used TeX macro set in the world today. As the core program of the Office suite, Microsoft Office Word provides many easy-to-use document creation tools, and is currently the largest word processor in the market. The Word file (.docx), which is a Word-specific file format, has become the de facto most common document standard. Document conversion is to convert Word, Pdf, Txt, Ooxml, Odf, Html and other document formats. For example, the method of converting Ooxml and Od...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/151
CPCG06F40/151
Inventor 宋军徐衡朱超群彭艳曹威张坤吴雅笛
Owner CHINA UNIV OF GEOSCIENCES (WUHAN)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products