Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for reconstructing document file

A document file and document technology, applied in the field of document file reconstruction, can solve the problems of reducing the document display client's parsing speed of the document file, occupying the storage space of the network device, etc., to achieve the effect of improving the speed and reducing the file size.

Active Publication Date: 2014-06-18
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a method and device for document file reconstruction to solve the problem that the original document file is directly stored in the network device in the format supported by the document display client in the prior art, so that the network device is occupied. A large amount of storage space, and the problem of slowing down the parsing speed of the document file by the document display client of the user device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for reconstructing document file
  • Method and device for reconstructing document file
  • Method and device for reconstructing document file

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0055] Example 1: The elements of the document file include pictures, and the types of pictures include but are not limited to: vector graphics, bitmaps, etc., and the network device combines vector graphics in adjacent layers to obtain a combined vector graphics, and then merges the The obtained vector image is combined with the bitmap to obtain a combined bitmap, and the combined bitmap is used as one of the elements of the aggregated document file.

[0056] Wherein, the vector diagram of the adjacent layer can be determined according to the following manner:

[0057] -Determine the coverage relationship between document file elements according to the rendering order of document file elements;

[0058] -Determine the vector graphics on adjacent layers based on the coverage relationship between document file elements; specifically, based on the coverage relationship between document file elements, the specific methods for determining the vector graphics on adjacent layers may...

example 2

[0066] Example 2: Document file elements include text, if the style information in the attribute information is the same and the position information is in the same row or column of text, when the formed matrix intersection does not cover the picture, the style information in the attribute information is the same and the position information is in Texts in the same row or column are combined to obtain the combined text as one of the elements of the aggregated document file.

[0067] Among them, the matrix intersection formed by characters with the same style information and position information in the same row or column in the attribute information, the minimum abscissa, minimum ordinate, maximum The abscissa and the maximum ordinate are determined.

[0068] In this embodiment, the number of DOM (Document Object Model, document object model) nodes is reduced by merging the text, and the speed at which the document display client of the user device presents pictures is further ...

example 3

[0070] Example 3: By extracting the intersection of the font information in the text attribute information and the font file of the document file, the font information after the intersection processing is obtained as one of the aggregated attribute information.

[0071] Wherein, the font file of the document file includes the font information of all characters, that is, it also includes the font information of characters that do not exist in the document file, and the font information after the intersection processing only includes the font information of the characters in the document file. Include font information for text that does not exist in this document file.

[0072] In this embodiment, the font information after the intersection processing obtained through the intersection processing only includes the font information of the characters in the document file, so as to further reduce the storage space of the document file on the network device.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for reconstructing a document file. The method comprises the following steps: resolving the document file to obtain a document file element and the attribute information of the document file element; aggregating the obtained document file element and attribute information to obtain aggregated document file element and attribute information; reconstructing on the basis of the aggregated document file element and attribute information to obtain a reconstructed document file. Compared with the prior art, the method and the device have the advantages that the document file element and the attribute information which are obtained by resolving are aggregated, and the document file is reconstructed on the basis of the aggregated document file element and attribute information, so that the size of the reconstructed document file is reduced, and the resolving and presenting speeds of the reconstructed document file are increased for a document display client of user equipment.

Description

technical field [0001] The invention relates to document file processing technology, in particular to a method and device for document file reconstruction. Background technique [0002] When a user uses the document display client on the user device to read the document file, in the prior art, the network device uses a specific document processing program to convert the format of the original document file, and the format-converted document file can be presented on the document display client , the network device provides the format-converted document file to the user equipment for presentation on the document display client. Taking the document display client as a browser as an example, the document file in PDF (Portable Document Format, portable file format) format can be converted into a document file in HTML (Hypertext Markup Language, hypertext markup language) format by using the PDFtoHTML document processing program. The document processing program converts the docum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/34
Inventor 陈昌兵
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD