Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Error correction using fact repositories

a fact repository and error correction technology, applied in the field of computer systems, artificial intelligence and intelligence analysis, can solve problems such as textual and video archives that have errors, and do not contemplate the possibility of correcting other context and/or semantic types of errors

Active Publication Date: 2013-10-15
INT BUSINESS MASCH CORP
View PDF10 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Inevitably, errors occur in computerized text and voice and other such processing.
Errors also originate from other sources, e.g., mistyped data and other mistakes made by people entering the data.
However, they do not contemplate correcting other context and / or semantic type of errors.
Similarly, many repositories of data such as relational and extended markup language (XML) databases, textual and video archives have errors, either in their content or in associated metadata.
Other than for simple cases such as a mismatch between a zip code and a town name, current automated error correcting computer systems or software do not handle correcting such errors.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Error correction using fact repositories
  • Error correction using fact repositories
  • Error correction using fact repositories

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012]The system and method of the present disclosure in one embodiment corrects errors occurring in computerized text and voice processing and the like using fact information. Many OCR errors such as misinterpreted names and locations, speech recognition errors, handwriting recognition errors and any other conversion from analog to digital medium may be automatically corrected by using facts. Similarly, many repositories of data such as relational and XML databases, textual and video archives have errors, either in their content or in associated metadata. The system and method of the present disclosure in one embodiment apply to the textual part of such databases, i.e., textual fields and associated metadata. The novelty of the solution, in one aspect, lies in applying stores of factual information, for example, large scale stored of factual information, to correct some of such errors.

[0013]In one embodiment, facts (e.g., relations between entities) are extracted from resources suc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosed system and method apply stores of factual information to correct errors in digital text, for example, generated from OCR, speech and / or handwriting recognition devices, and other automatic recognition devices. A text produced by OCR, speech recognition, handwriting recognition, and others may be processed to extract discussed facts. Databases of facts are searched based on information in the text. After comparing facts asserted in the text with the factual data from the databases, suggested corrections of the text are produced.

Description

BACKGROUND[0001]The present disclosure relates generally to computer systems, artificial intelligence and intelligence analysis, and more particularly to correcting errors using fact repositories.[0002]Computers are used to transcribe speech and handwriting. They are also used to convert scanned images of text into text. Examples of such processing include optical character recognition (OCR) that converts paper documents into digital form by scanning, speech recognition that converts voice into text, and handwriting recognitions. Inevitably, errors occur in computerized text and voice and other such processing. Errors also originate from other sources, e.g., mistyped data and other mistakes made by people entering the data.[0003]Existing systems currently correct errors based on a “language model”, i.e. an encoding of statistical information about co-occurrence of words or word patterns. For instance, existing solutions correct some spelling errors or grammatical errors. However, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(United States)
IPC IPC(8): G06F17/21
CPCG06F17/273G06F40/232
Inventor FERRUCCI, DAVID A.GONDEK, DAVID C.ZADROZNY, WLODEK W.
Owner INT BUSINESS MASCH CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products