Bill text recognition method based on novel network

A text recognition and network technology, applied in the field of computer vision, can solve the problem of low recognition accuracy, and achieve the effect of strong generalization, excellent robustness and high accuracy

Pending Publication Date: 2020-11-03
晶璞(上海)人工智能科技有限公司
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] OCR recognition technology has become the main means of converting paper documents into electronic documents. This technology can greatly facilitate people's information entry work. After normalizing the text line image data obtained by the detection module to a fixed height, it is necessary to use general text recognition. Technology, to identify the text content corresponding to the image. Currently, the text lin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bill text recognition method based on novel network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0063]For example, on a general computer, the scanned medical outpatient (emergency) receipts in various provinces and cities across the country are processed. After the image data of the image is obtained in step 110, the training and verification data sets are established in step 120, the neural network model structure is constructed in step 130, the model training is performed in step 140, and the model deployment and prediction reasoning are performed in step 150. According to the recognition result, the structured output is output to excel or json file.

Embodiment 2

[0065] For example, on a general computer, the ID card obtained by scanning is processed, and the method described in the present invention is used. After the image data of the image is obtained in step 110, the training and verification data sets are established in step 120, the neural network model structure is constructed in step 130, the model training is performed in step 140, and the model deployment and prediction reasoning are performed in step 150. According to the recognition result, the structured output is output to excel or json file.

Embodiment 3

[0067] For example, on a general computer, the scanned bank documents are processed using the method described in the present invention. After the image data of the image is obtained in step 110, the training and verification data sets are established in step 120, the neural network model structure is constructed in step 130, the model training is performed in step 140, and the model deployment and prediction reasoning are performed in step 150. According to the recognition result, the structured output is output to excel or json file.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text recognition technology in the field of computer vision, in particular to a bill text recognition method based on a novel network, which comprises the following specificsteps: step 110, acquiring image data of a bill; step 120, creating a text recognition data set; using the text line image data obtained by the text detection module to carry out input and quality inspection on the text line image data, writting a text label into txt or a text line image name, and dividing the data into a training set, a test set and a verification set according to a certain proportion; compared with a common text recognition method, the method is higher in generalization and better in robustness by means of a feature extraction and image correction algorithm in deep learning, can effectively cope with severe conditions such as bill distortion, perspective, printing body ink mark breakage and dirt; by adding BN layers to different levels, the recognition accuracy is higher.

Description

technical field [0001] The invention relates to text recognition technology in the field of computer vision, and specifically designs a bill text recognition method, and the application scene is the text recognition of medical bills. Background technique [0002] OCR recognition technology has become the main means of converting paper documents into electronic documents. This technology can greatly facilitate people's information entry work. After normalizing the text line image data obtained by the detection module to a fixed height, it is necessary to use general text recognition Technology, to identify the text content corresponding to the image. Currently, the text line image data is affected by illumination, geometric transformation, background, font, style, resolution, etc., and the shape, texture and size change a lot, which poses a challenge to the recognition work, so it cannot be effective. It can effectively deal with harsh conditions such as distortion of bills, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V30/40G06V30/10G06N3/045G06F18/214
Inventor 陈俊霞严京旗周审章卞志强张成栋
Owner 晶璞(上海)人工智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products