Compound document image compression using multi-region two layer format

a document image and multi-region technology, applied in the field of compound document image compression using multi-region two-layer format, can solve the problems of document formatting loss or alteration, document compression methods that are not supported by the current pdf or the adobe acrobat program, and document formatted with mrc compression or other compression methods using more than two layers,

Active Publication Date: 2006-12-05
HEWLETT PACKARD DEV CO LP
View PDF11 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In comparison, when a document is transmitted in a format according to a word processing program, some of the formatting can be lost or altered.
Certain types of document compression methods, however, are not supported by the current PDF or the A...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compound document image compression using multi-region two layer format
  • Compound document image compression using multi-region two layer format
  • Compound document image compression using multi-region two layer format

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0011]Embodiments consistent with the present invention divide an image into, for example, rectangular regions such that all text within a region has a uniform color under certain criteria. Each region is separated into two layers, a layer of text within the region and a layer of non-text information. Both layers have the same size as the region in this example. The text layer is represented by, for example, a binary two-dimensional matrix having values “0 ” and “1.” Bit value “1” means that the pixel is a text pixel and bit value “0” means the pixel is not a text pixel; different values can alternatively be used. Moreover, the color of the text can be represented by, for example, three 8-bit numbers R, G, and B for the red, green, and blue color values. The non-text layer is represented by, for example, a two-dimensional matrix that uses three 8-bit numbers (R, G, B) for every pixel to specify its color or for groupings of pixels to specify their collective or common color. Differe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Two layer formatting of documents for compatibility with two layer formatting schemes while maintaining color information and edge sharpness for text. A document is divided into multiple regions based upon bodies of text having the same color. A text layer and a non-text layer are specified for each region. The text layer includes a text color along with binary values for each pixel to specify whether to use the text color or a background color. The non-text layer includes a red-green-blue value for each pixel to specify its color for both image or non-text information including the background color for the bodies of text. The text layer is compressed using a lossless compression method and the non-text layer is compressed using a lossy compression method.

Description

FIELD OF THE INVENTION[0001]The present invention relates to an apparatus and method for compressing images and text within a document using a two layer format and a separate compression technique for each format.BACKGROUND OF THE INVENTION[0002]A standard for formatting documents includes portable document format (PDF), a page description language used by, for example, the Adobe Acrobat program. Formatting a document as a PDF file means that the document can be transmitted, such as through attachment to an e-mail, without a loss of formatting of the information in the document. Using a PDF viewer, a recipient of the document can open and view the document, and it will have the same format as when transmitted. In comparison, when a document is transmitted in a format according to a word processing program, some of the formatting can be lost or altered. Therefore, conversion of documents to PDF files preserves the original formatting.[0003]Certain types of document compression method...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04N1/60G06T9/00H04N1/40H04N1/41H04N1/413H04N1/46
CPCH04N1/41
Inventor FAN, JIAN
Owner HEWLETT PACKARD DEV CO LP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products