Text information processing method, system and device and computer readable storage medium

An information processing method and text technology, applied in computer components, calculation, electrical digital data processing, etc., can solve problems such as high error rate, overlapping text coverage, difficult recognition, etc., and achieve the effect of improving accuracy

Active Publication Date: 2020-07-28
北京爱咔咔信息技术有限公司
View PDF10 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the process of text recognition through OCR technology, the following recognition errors often occur: for some long text information on paper documents, the head and/or tail of the text are cut due to text positioning deviation; Structural fonts, it is easy to mistakenly identify Chinese characters with left-right structure or left-right structure in the text information as two or more characters, such as recognizing "ka" as "mouth card", etc.; because paper documents are not clear, printing Difficulties in

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text information processing method, system and device and computer readable storage medium
  • Text information processing method, system and device and computer readable storage medium
  • Text information processing method, system and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the objectives, technical solutions and advantages of the present invention clearer, the following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are the present invention. Invented some embodiments, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0025] The terms "first", "second", etc. involved in the present invention are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. In the description of the following embodiments, "multiple" means two or more, unless otherwise c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text information processing method, system and device and a computer readable storage medium. According to the method, the error correction model obtained by training the error correction training set corresponding to the type of the to-be-processed text is adopted in advance, error correction processing is conducted on the to-be-processed text, at least one correction text of the to-be-processed text is obtained, and correction of glyph errors and the like in the to-be-processed text is achieved; a named entity recognition model is obtained by training a structured feature training set corresponding to the type of a to-be-processed text in advance; extracting structural features of the corrected text; matching the structural features of the corrected text with the structural features of each part of standard text information in a trusted data set; and determining standard text information corresponding to the corrected text, thereby further correcting named entity errors existing in the corrected text through the structured features, and improving the accuracy of text information recognition.

Description

Technical field [0001] The present invention relates to the field of image processing technology, in particular to a text information processing method, system, equipment and computer-readable storage medium. Background technique [0002] In daily work or life, paper documents such as various bills and certificates are used, such as invoices, business licenses, etc. In order to realize the identification of paper documents, computer technology is used to automatically identify text information printed on paper Become a trend. Especially for key text information such as company names, it has specific structural features and requires high recognition accuracy. In many financial situations, company names and similar text information are not allowed to have any errors. [0003] At present, the recognition of text information printed on paper mainly uses optical character recognition (Optical Character Recognition, hereinafter referred to as OCR) technology, which uses optical technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/295G06K9/62G06N3/04
CPCG06V30/10G06N3/044G06N3/045G06F18/214
Inventor 邬国锐李杨
Owner 北京爱咔咔信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products