Document format conversion system and method

A document format and document technology, applied in the computer field, can solve problems such as inability to share, communicate, limited functions, and inability to integrate audio/video resources

Active Publication Date: 2010-04-28
科大讯飞(上海)科技有限公司
View PDF1 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Disadvantages are similar to desktop document readers, and cannot be shared and communicated
[0005] The current online PPT function generally uses the Flash format, which has limited functions, no animation effects, no interactive functions, and cannot integrate external audio / video resources, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document format conversion system and method
  • Document format conversion system and method
  • Document format conversion system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] see figure 1 , the present invention discloses a document format conversion system 10, which includes a picture conversion module 11, a text acquisition module 12, a text image mapping module 13, and a PPT-FLASH conversion module 14.

[0058] The image conversion module 11, the text acquisition module 12, and the text image mapping module 13 are used to convert the document into an image format, and obtain the text corresponding to each position of the image.

[0059] The image conversion module 11 is used to convert each page of the document into the data of the image format; the text acquisition module 12 is used to obtain the text of each page of the document, the status information of each text in the picture; the text image mapping module 13 is used to generate A mapping table corresponding to the text information on each page and the picture, which contains the state information of each text in the picture.

[0060] Described picture conversion module 11, text ac...

Embodiment 2

[0115] One of the improvements of the present invention lies in the conversion of documents, please refer to Image 6 , the conversion rules of the present invention are as follows:

[0116] Plain text information of text data → plain text data (.txt format);

[0117] Font information, text effects and image data → image data (.png format);

[0118] Correspondence between text data and image data→XML data (.xml format);

[0119] Multimedia data→Adobe Flash (.swf format);

[0120] Script data → throw away (for security reasons).

[0121] see Figure 4 , in the lossless image conversion of the image conversion module, word-for-word analysis technology is used to ensure that all information in the document is read, the current best color 32-bit image technology is used to generate memory images, and font mapping technology and quadratic cubic algorithm are used to ensure image integrity quality.

[0122] The principle and technology of this embodiment are as follows:

[01...

Embodiment 3

[0141] This embodiment not only includes the document conversion function, but also includes the FLASH conversion function, which can convert the PPT file into a FLASH file.

[0142] The key technology applications are as follows:

[0143] (1) PPT courseware parser

[0144] The system obtains the document information of the PPT courseware through the API interface provided by Microsoft Office Powerpoint. The present invention collects each object in the PPT courseware, the layout and shape of the slides, the animation effect of the text and some embedded objects. Use this information to convert them into objects in Flash format.

[0145] Please refer to Table 1. The system parses out the PPT layout and document content, accurately calculates the position, size and geometric shape of each object, and generates a corresponding flash format document through the obtained animation effect. The following table describes the system How to deal with each object of PPT.

[0146] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document format conversion system and method. The document format conversion system comprises a picture conversion module, a text acquisition module and a text and image mapping module; the picture conversion module is used to convert each page of a file to the data of picture format; the text acquisition module is used to obtain characters on each page of the file and the state information of characters in pictures; and the text and image mapping module is used to generate a mapping table corresponding to the text information and pictures on each page, and the table contains the state information of all characters in pictures. The invention can be used to avoid the problem that documents can not be read as a wet plug-in is not installed.

Description

technical field [0001] The invention belongs to the technical field of computers, and relates to a format conversion system, in particular to a document format conversion system; in addition, the invention also relates to a conversion method of the document format conversion system. Background technique [0002] Nowadays, computer users can read various e-books through the Internet, such as files in formats such as WORD, TXT, and PDF. The existing common practice is to make the text into the format of hypertext markup language HTML. For example, Chinese patent CN200510125040.X provides a system and method for converting a formatted document into a web page. The system and method for converting a document into a web page may include a mapping module, which is programmed to map the document style of the document to the style of the page. The system may also include a conversion module programmed to convert content of the document into HTML based on the mapping of the mapping...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/21G06F17/22G06F17/30G06T11/60
Inventor 陆昀
Owner 科大讯飞(上海)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products