Character recognition method and device, equipment and medium

A character recognition and to-be-recognized technology, applied in the computer field, can solve problems such as increasing training costs and manpower, and achieve the effect of reducing labeling costs and improving training efficiency

Pending Publication Date: 2022-05-06
BEIJING YOUZHUJU NETWORK TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to improve the recognition accuracy of the OCR recognition model, when collecting a large a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character recognition method and device, equipment and medium
  • Character recognition method and device, equipment and medium
  • Character recognition method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0027] In order to make those skilled in the art better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

[0028] OCR refers to the technology of analyzing and recognizing image files containing text data to obtain text. Usually, the OCR recognition model is generated by supervised training methods. During the training process, it is necessary to collect sample data that has been manually annotated. , and then use the sampl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a character recognition method, which is implemented by utilizing a pre-trained character recognition network model when character recognition is performed, and the character recognition network model is generated by training a first sample image and a plurality of sub-sample images corresponding to the first sample image. Wherein the height of each sub-sample image in the plurality of sub-sample images is the same as the height of the first sample image, the width of each sub-sample image is the same, and the width of each sub-sample image is smaller than the width of the first sample image. According to the method, the first sample image does not need to be manually annotated, the character recognition network model is trained in a mode of aligning the local features (multiple sub-sample images) and the overall features (the first sample image), the annotation cost is reduced, and the training efficiency is improved. In actual use, a text image is input into the character recognition network model, so that features of text information in the image can be completely extracted, recognition is performed according to the features, and an output result is obtained.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a character recognition method, device, equipment and medium. Background technique [0002] Optical Character Recognition (OCR) refers to the technology of analyzing and recognizing image files containing text data to obtain text, which is an important aspect in the field of automatic recognition technology research and application. [0003] Usually, the OCR recognition model is generated through a supervised training method. During the training process, it is necessary to collect sample data that has been manually marked, and then use the sample data for training. In order to improve the recognition accuracy of the OCR recognition model, when collecting a large amount of sample data, it needs to consume a lot of manpower for manual labeling, which increases the training cost. Contents of the invention [0004] In view of this, the embodiments of the present applic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06V20/62G06V30/148G06V10/774G06K9/62
CPCG06F18/214
Inventor 毛晓飞黄灿
Owner BEIJING YOUZHUJU NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products