Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document analysis system and method

a document analysis and document technology, applied in the field of document analysis, can solve the problems of time-consuming and costly scanning of paper documents to make the content thereon available in a digital environment, requiring the user to wait for an analysis of a whole document, and the processing of various regions of scanned documents may take a long tim

Inactive Publication Date: 2006-05-16
HEWLETT PACKARD DEV CO LP
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0005]The present invention provides a document analysis system and method. In one embodiment, the document analysis system includes a software implementation on a processor circuit, although dedicated logical circuits may be employed as well. The document analysis system includes an interim analyzer configured to perform an interim document analysis to identify a number of interim regions on a document at an initial setting of pixels-per-inch (PPI). The document system also includes a complete analyzer configured to perform a complete analysis on at least one of the interim regions at a second, higher PPI, thereby generating at least one complete region therefrom. The present invention provides significant flexibility to the user with a number of options relative to the analysis of the regions of information of interest in a document, and to limiting the analysis to such preferred regions.
[0007]The present invention has numerous advantages, a few of which are delineated hereafter as merely examples. Specifically, the present invention provides the user with a fast display of the various regions of information on a document and allows the user to control further analysis of these regions and identify the type of information contained therein before processing the regions in an appropriate processing pipeline which may use optical character recognition algorithms, etc. The present invention is also simple in design, user friendly, robust, reliable, and efficient in operation, and easily implemented for mass commercial production.

Problems solved by technology

However, the scanning of paper documents to make the content thereon available in a digital environment may be time consuming and costly.
In particular, one problem is that the processing of various regions of scanned documents may take a long time requiring the user to wait for an analysis of a whole document.
However, current users are often forced to wait while scan converter technology analyzes an entire document to determine the specific data types of the various regions which are ultimately applied to processing pipelines such as optical character recognition pipelines, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document analysis system and method
  • Document analysis system and method
  • Document analysis system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017]Referring to FIG. 1, shown is a block diagram of a document analysis system 100 according to an embodiment of the present invention. The document analysis system 100 includes a computer system 103 which comprises a processor 106, and a volatile / nonvolatile memory 113 (“memory 113”), both of which are coupled to a local interface 116. The computer system 103 further comprises a video interface 119, a number of input interfaces 123, a modem 126, a number of output interfaces 129, and a mobile data storage device 133, all of which are also coupled to the local interface 116. The memory 113 may include, for example, a random access memory (RAM), a read only memory (ROM), a hard drive, and other like devices, or any combination of these devices. Note that the term volatile refers to memory devices that generally lose data stored therein upon loss of power, and non-volatile refers to memory devices that do not lose data upon loss of power.

[0018]The document analysis system 100 also ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed is a document analysis system and method. The document analysis system includes an interim analyzer configured to perform an interim document analysis to identify a number of interim regions on a digital document at an interim pixels-per-inch (PPI). The document analysis system also includes a complete analyzer configured to perform a complete analysis on at least one of the interim regions at a second PPI, thereby generating at least one complete region therefrom. The document analysis system and method provides significant flexibility to the user with a number of options relative to the analysis of the regions of information of interest in a digital document and to limit analysis to such preferred regions.

Description

CROSS-REFERENCE TO RELATED APPLICATION[0001]This application is a continuation of U.S. patent application Ser. No. 09 / 296,094, filed on Apr. 21, 1999, issued on Jan. 6, 2004, as U.S. Pat. No. 6,674,901.TECHNICAL FIELD[0002]The present invention is generally related to document analysis and, more particularly, is related to a document analysis system and method to flexibly control the analysis of a scanned document or other digital representation of a document.BACKGROUND OF THE INVENTION[0003]More and more documents are generated using word processors and the like and are stored on memory devices such as hard drives, floppy disks, compact disks and other mass storage media. Nonetheless, paper and other similar media will continue to be used far into the future. Consequently, there will continually be a need to scan the substance portrayed on such media so that such information may be manipulated on a computer or other like device.[0004]However, the scanning of paper documents to make...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(United States)
IPC IPC(8): G06K9/34G06V30/40
CPCG06K9/00442G06V30/40
Inventor SIMSKE, STEVEN JRUSSON, VIRGIL K
Owner HEWLETT PACKARD DEV CO LP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products