Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text recognition method, device, device and storage medium

A text recognition and text area technology, applied in character and pattern recognition, instruments, computing, etc., can solve problems such as unrecognizable text images, difficult to simulate data structures, chaotic text character order, etc., to avoid text disorder and labeling work, easy to accurately identify the effect

Active Publication Date: 2021-10-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, OCR (Optical Character Recognition, optical character recognition) technology is widely used. When OCR technology detects text, the detected text area is generally a rectangular frame, a rotated rectangular frame or a four-point rectangular frame, such as figure 1 As shown, this detection method, for irregular text (such as curved text), the detected text area will include a large number of background areas, which will cause great interference to the text recognition
Moreover, the existing text recognition methods, such as the convolutional recurrent neural network CRNN (Convolutional Recurrent Neural Network) recognition method, are only good for rectangular text image recognition, but cannot be recognized for text images that include a large number of backgrounds.
[0003] In addition, the existing recognition methods for irregular text include attention model (Attention model) and polar coordinate correction method, wherein the attention model can recognize the text of 2D structural information (such as formula), and can be applied to the irregular text Recognition, but requires a large amount of training data, and it is difficult to construct through simulated data. At the same time, it will introduce the problem of chaotic text characters
The polar coordinate correction method is to restore the arc-shaped text to a straight-line text and then perform text recognition. This method is not robust to lighting, distortion and complex scenes.
[0004] Nowadays, in the cloud operations of many online businesses, the text of certain types of objects in the image will be recognized and authenticated, such as the text of the seal in the images of official documents, bills, and certificates, etc., and the seal will generally include Irregular text (such as curved text, T-shaped text, etc.), from the above analysis, it can be seen that none of the existing text recognition technologies can effectively identify the irregular text in the seal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method, device, device and storage medium
  • Text recognition method, device, device and storage medium
  • Text recognition method, device, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Various exemplary embodiments, features and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. The same reference numbers in the figures denote elements that have the same or similar functions. While various aspects of the embodiments are shown in the drawings, the drawings are not necessarily drawn to scale unless otherwise indicated.

[0046] The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments.

[0047] In addition, in order to better illustrate the present disclosure, numerous specific details are given in the following detailed description. It will be understood by those skilled in the art that the present disclosure may be practiced without certain specific details. In some instances, methods, means, components a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure relates to a text recognition method, device, equipment and storage medium. The method includes inputting the obtained preset object image into a progressive expansion network for segmentation to obtain target segmented images including irregular text regions; using a thin-plate spline interpolation algorithm to correct the target object image, and using a text recognition model to identify the corrected Horizontal object text image. The pixel-level segmentation of preset object images can be realized to effectively detect text areas of various shapes, and by correcting irregular text areas to obtain horizontal text areas for recognition, it is possible to avoid text errors caused by direct recognition of irregular text Out-of-order problems, as well as a large amount of labeling work for irregular text, text recognition models that can be trained with horizontal text images have stronger generalization capabilities. In addition, the correction is specifically implemented through the TPS algorithm, which can be applied to more complex application scenarios and has better robustness.

Description

technical field [0001] The present disclosure relates to the technical field of artificial intelligence, and in particular, to a text recognition method, apparatus, device and storage medium. Background technique [0002] At present, OCR (Optical Character Recognition, Optical Character Recognition) technology is widely used. When OCR technology detects text, the detected text area is generally a rectangular frame, a rotated rectangular frame or a four-point rectangular frame, such as figure 1 As shown, in this detection method, for irregular text (such as curved text), the detected text area will include a large number of background areas, which will cause great interference to the recognition of the text. Moreover, the existing text recognition methods, such as the Convolutional Recurrent Neural Network (CRNN) recognition method, are only effective in recognizing rectangular text images, but cannot recognize text images including a large number of backgrounds. [0003] In...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/34G06K9/20G06K9/62
CPCG06V10/22G06V30/153G06V10/267G06F18/214
Inventor 包志敏
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products