Method and system for converting Word document into LaTeX document

A document conversion and document technology, which is applied in the field of Word document to LaTeX document conversion, can solve the problems of low actual use value and single conversion function, and achieve the effect of filling the gap in the field, improving work efficiency, and reducing difficulty and complexity

Active Publication Date: 2019-08-20
北京东青数科技有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a method and system for converting a Word document to a LaTeX document, which can realize different types of documents The conversion between documents reduces the difficulty of multi-document presentation and improves the efficiency of document use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for converting Word document into LaTeX document
  • Method and system for converting Word document into LaTeX document
  • Method and system for converting Word document into LaTeX document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to have a clearer understanding of the technical features, purposes and effects of the present invention, the specific implementation of the present invention will now be described in detail with reference to the accompanying drawings.

[0058] A method for converting a Word document to a LaTeX document, which is applied to an application program in a computer device, and the application program is started after responding to an artificial trigger instruction, such as figure 1 shown, including:

[0059] S1. The user submits the Microsoft Office Word source file to the system;

[0060] S2, the system opens the Microsoft Office Word source file;

[0061] S3. Perform an initial analysis of data elements such as text, pictures, tables, and formulas in the source file through the JACOB component, obtain the category of each data element and the relative position information in the source document, and record the analyzed category and position parameters ;

[0062...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and system for converting a Word document into a LaTeX document, being characterized in that a user submits a Microsoft Office Word document, and the system utilizes aJACOB technology to carry out initial analysis on text, pictures, formulas, tables and other data in the document; data elements in the source file are extracted through the Apache POI technology andthe JACOB technology, and relative position information of all the elements is recorded; the extracted text elements are classified according to a naive Bayes algorithm, and conversion of a source file formula based on a cascading automatic encoder is realized; the relative position information is combined with each data element to form information flow of the LaTeX target document; and the information flow is written into a target document so as to convert the information flow into a final LaTeX document. According to the method and system for converting a Word document into a LaTeX document,the difficulty and complexity of converting the Microsoft Office Word document into the LaTeX document can be reduced; a professional document conversion method is provided for college teachers, university students, college researchers and the like; and the working efficiency of document processing is improved.

Description

technical field [0001] The invention relates to the fields of document conversion and data processing, in particular to a method and system for converting a Word document to a LaTeX document. Background technique [0002] TeX provides a powerful and flexible typesetting language with up to 900 instructions, and TeX has a macro function, users can continuously define their own applicable new commands to expand the functions of the TeX system. LaTeX developed by Leslie Lamport is the most popular and widely used TeX macro set in the world today. As the core program of the Office suite, Microsoft Office Word provides many easy-to-use document creation tools, and it is also the word processor with the largest share in the market. The Word file (.docx), which is a Word-specific file format, has become the de facto most common document standard. Document conversion is to convert Word, Pdf, Txt, Ooxml, Odf, Html and other document formats. For example, the method that the docume...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/21G06F16/35
CPCG06F40/103
Inventor 宋军徐衡朱超群彭艳张坤曹威吴雅笛
Owner 北京东青数科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products