Difference item discrimination method and device

A discrimination method and technology of difference items, applied in the computer field, can solve the problems of OCR recognition errors and low accuracy of document comparison results, and achieve the effect of improving the accuracy.

Pending Publication Date: 2021-03-19
IFLYTEK CO LTD
View PDF9 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to noise interference (such as watermarks, signatures, etc.) in the OCR recognition process, OCR recognition errors will occur. Therefore, some of t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Difference item discrimination method and device
  • Difference item discrimination method and device
  • Difference item discrimination method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] Terms used in the embodiments of the present application are only for the purpose of describing specific embodiments, and are not intended to limit the present application. The terms "first" and "second" in the description and claims in the embodiments of the present application are used to distinguish different objects, rather than to describe a specific order.

[0045] For ease of understanding, the following first introduces related terms and the like that may be involved in the embodiments of the present application.

[0046] (1)OCR

[0047] OCR refers to the process in which an electronic device (such as a scanner or a digital camera) checks characters printed on paper, determines its shape by detecting light and dark patterns, and then uses character recognition methods to translate the shape into computer text. To put it simply, it is the process of analyzing and processing the image files obtained after scanning text materials to obtain text and layout informat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a difference item discrimination method and device, and the method comprises the steps: obtaining a target difference item in a recognition result of a first simple sentence and a recognition result of a second simple sentence, wherein the target difference item comprises a first difference text and a second difference text, the first simple sentence comprises a public itemand a first difference text, and the second simple sentence comprises a public item and a second difference text; determining a first probability corresponding to the first difference text and a second probability corresponding to the second difference text based on the language prediction model and the common item; and judging whether the target difference item is a real difference item or not according to the first probability and the second probability. By implementing the method and the device, the real difference item can be effectively judged, and the non-real difference item caused bythe OCR identification error is filtered, so that the accuracy of single sentence comparison is improved.

Description

technical field [0001] The present application relates to the field of computers, in particular to a method and device for discriminating difference items. Background technique [0002] With the development of business, a large number of documents are also generated, such as contract documents, bidding documents, etc. Among them, modification of the same document may generate multiple documents. For example, for the same contract document, there is one original document and possibly multiple modified versions of the document. In order to determine what modifications have been made to a document of a certain modified version relative to another document, it is necessary to compare the contents of the two associated documents to determine differences. [0003] Usually, optical character recognition (Optical Character Recognition, OCR) technology is used to scan and recognize the two documents to be compared, and then based on the recognition results output by the OCR, the con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62
CPCG06V30/418G06V30/10G06F18/22G06F18/253G06F18/24
Inventor 王亚利宋时德唐刘建庄纪军
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products