Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A character recognition and processing method for image acquisition by a digital screen

A technology for text recognition and image acquisition, which is applied in character recognition, character and pattern recognition, neural learning methods, etc. It can solve the problem of inability to deal with characters that are too close or noisy, the edges of text strokes are not smooth, and the generalization of text segmentation is poor, etc. problem, to achieve the effect of low-cost deployment, less resource occupation, and fast computing speed

Active Publication Date: 2021-09-28
北京匠数科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] First, compared with the printed text, the captured image of the digital screen has uneven, jagged, and often distorted edges. The text recognition algorithm for electronic documents is prone to misrecognition when dealing with the above situations;
[0007] Second, the general text detection model and text recognition model are complex and the training cost is high;
[0008] Third, the text and image recognition of the digital screen generally needs to be deployed on-site embedded devices, and real-time recognition is required
However, this method has poor generalization for text segmentation and cannot cope with the situation where the text spacing is too close or there is noise.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A character recognition and processing method for image acquisition by a digital screen
  • A character recognition and processing method for image acquisition by a digital screen
  • A character recognition and processing method for image acquisition by a digital screen

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] see figure 1 A method for character recognition and processing of images collected by a digital screen is provided, comprising the following steps:

[0038] S1: Using a convolutional neural network as the basic network of the text detection model and the text recognition model, cutting the number of convolution kernels and the number of convolution layers of the text detection model and the text recognition model;

[0039] S2: Complete the full convolution of the text detection model and the text recognition model through the 1x1 convolutional layer, and perform multi-scale feature extraction on the input image collected by the text screen. The feature map output by the text detection model is divided into two parts: the text area and the background area A kind of value, extracting the text area from the image collection image of the word walking screen through the mask;

[0040] S3: the text recognition model adopts an image classification model based on the alexnet str...

Embodiment 2

[0063] The present invention provides a computer-readable storage medium. The computer-readable storage medium stores program codes for character recognition processing of images captured by a digital screen. Instructions of the method for character recognition and processing used for image acquisition by the digital screen in the implementation manner.

[0064] The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server, a data center, etc. integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a hard disk, or a magnetic tape), an optical medium (for example, DVD), or a semiconductor medium (for example, a solid state disk (SolidStateDisk, SSD)) and the like.

Embodiment 3

[0066] The present invention provides an electronic device, the electronic device includes a processor, the processor is coupled with a storage medium, and when the processor executes instructions in the storage medium, the electronic device executes Embodiment 1 or its A character recognition processing method for images collected by a digital screen in any possible implementation manner.

[0067] Specifically, the processor can be implemented by hardware or by software. When implemented by hardware, the processor can be a logic circuit, an integrated circuit, etc.; when implemented by software, the processor can be a general-purpose processor. The processor is realized by reading the software codes stored in the memory. The memory may be integrated in the processor, or it may be located outside the processor and exist independently.

[0068] In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When impl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a character recognition processing method for images collected by a digital screen. The characteristic map output by a character detection model consists of two values, the text area and the background area, respectively, and the text area is extracted from the image collected by the digital screen through a mask. ; The input image of the text recognition model is a preset size, and the output of the text recognition model is a character category, and the character area text area is obtained by querying the mapping relationship between the category value and the computer character; aggregation is performed according to the row coordinates of the character area text area, and According to the sequence of each character row coordinates from left to right, the characters are composed into a string; the training phase of the text detection model introduces text edge and text gap images as training data, and in the inference phase, when the center of the sampling window falls on the text edge or between the two When there is a space in the middle of a text, the final feature value is defined as the background. The present invention can simulate the text features of the text screen and generate characteristic training data, so that the model can realize targeted training on the text of the text screen, and the training effect is good.

Description

technical field [0001] The invention relates to the technical field of word processing, in particular to a method for character recognition and processing for image acquisition by a digital screen. Background technique [0002] At present, text recognition (OCR) in images is an important scene in the field of deep learning. Different from the traditional method of using image processing technology to extract text area features and using classifiers to determine characters, OCR technology based on deep learning uses deep neural networks to extract image features, which can achieve much higher recognition accuracy than traditional effects. [0003] OCR processing based on deep learning is generally divided into two models, text area detection and text character recognition. The text area detection model scans the input text image and marks the text area; the text character recognition model extracts and classifies each character in the text area to obtain the character value,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/32G06K9/34G06N3/04G06N3/08
CPCG06N3/04G06N3/08G06V20/62G06V30/153G06V30/10
Inventor 侯磊张乐平张博支蕴倩李海峰
Owner 北京匠数科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products