Image data processing method of electronic document and device thereof

A technology for image data and electronic documents, applied in the field of electronic document data processing, can solve problems such as large storage overhead, merging errors, and difficult technical implementation, so as to improve the compression rate, reduce the number of I/O operations, and reduce redundant information described effect

Active Publication Date: 2010-11-24
PEKING UNIV +2
View PDF0 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For this application, it is necessary to identify which small volume images can be merged, which is prone to m

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image data processing method of electronic document and device thereof
  • Image data processing method of electronic document and device thereof
  • Image data processing method of electronic document and device thereof

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0026] figure 1 is a flowchart of an image data storage method of an electronic document according to the first embodiment of the present invention.

[0027] refer to figure 1 , in step S1000, images to be processed are collected from electronic documents. The electronic document formats that can be processed by the present invention include formats such as PDF, XPS, CEB, and MARS. In this step, an array can be used to record the path of the collected images stored on the disk or the location in the electronic file.

[0028] In step S1002, an IFC file and an index structure are established. Here, the IFC file refers to a newly created file for storing image data and recording index information, at least including file header information, data area for storing image data, index, index entry and other parts. File header information must be at the beginning of the file. In the file header information, fields such as file type, version information, compression unit, and compr...

no. 2 example

[0038] as in figure 1 As described in step S1006 of , the image data can be meaningfully segmented according to different segmentation strategies. At this time, the index number assigned to the current image is the corresponding number in the data segment to which it belongs. Sometimes, the original image collection sequence itself matches the segment sequence, so the current image data can be assigned index numbers sequentially. However, sometimes, the original image collection order may not be that orderly. At this time, in the case of different segmentation strategies, it is likely that the current image cannot be assigned an index number sequentially, but the current image is assigned the corresponding index number in the data segment to which it belongs according to different data segments. That is, in terms of the order in which images are collected, the assigned index numbers are skipped. In this case, instead of writing the image data segment by segment, a plurality...

no. 3 example

[0046] As mentioned above, collected image data can be meaningfully segmented according to different strategies, so that a certain segment of image data can be used more efficiently by prefetching and caching the image data. For the index structure, it is preferable to use a secondary index structure. The secondary index structure can improve the flexibility of index organization and the speed of index loading, thereby improving the efficiency of operation and wider application range. In the secondary index structure of this embodiment, a primary index and a segment index are set. Correspondingly, the main index entry (the offset position of the main index in the IFC file) is recorded in the IFC file. The main index at least records information such as the number of segment indexes, the offset position of the segment index in the IFC file, and the amount of image data included in the segment. The segment index at least records information such as the offset position of the d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an image data processing method of an electronic document and a device thereof. The image data processing method includes the methods of storing, searching, modifying, canceling and adding image data and comprises the steps of: acquiring image information and the image data from images collected from the electronic document; allocating index number; writing the image data into a corresponding data area in an IFC (Image File Cluster) file; updating corresponding index information according to the image data; and replacing the description using the image in the electronic document by the quote and the index number of the corresponding image information. The methods of searching, modifying, canceling and adding image data are carried out on the image data in the IFC file by an index structure. The invention intensively stores the image data distributed in the electronic document in the IFC file and carries out significative segmentation on the image data according to difference segmentation policies so as to remarkably reduce the storage expense and improve the access efficiency.

Description

technical field [0001] The invention belongs to the field of electronic document data processing, and in particular relates to an electronic document image data processing method and device thereof. The image data processing includes operations such as storing, searching, modifying, adding, and deleting image data of electronic documents. Background technique [0002] There are multiple electronic document formats, each of which uses a different way to describe the images in it. For example, in XPS and MARS documents, XML language and Zip packaging are used to organize the document format, and each image is stored separately. When there are many images in the document, it is necessary to describe the image-related information such as its header information and color space for each image, resulting in the generation of a large amount of redundant content and the expansion of the document volume, which makes the storage overhead of the document and the running time The memory...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 仇睿恒王毅
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products