Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Language identification method of scene text image in combination with global and local information

A text image and language recognition technology, applied in the field of computer vision, can solve the problems of the disadvantage of the number of distinguishing features, image block interference network prediction of the whole image, redundant calculations, etc.

Active Publication Date: 2019-10-15
HUAZHONG UNIV OF SCI & TECH
View PDF11 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, they considered the contribution of all high-dimensional features to the recognition results, but if there are many non-discriminative characters, the resulting redundant unimportant features will make the truly important features less significant, or Discriminative features are underutilized due to numerical disadvantages
Secondly, the method of directly cutting out a series of image blocks from the original image will cause some image blocks to be unrecognizable at all, which will interfere with the network's prediction of the entire image, and will generate redundant and unnecessary calculations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language identification method of scene text image in combination with global and local information
  • Language identification method of scene text image in combination with global and local information
  • Language identification method of scene text image in combination with global and local information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0042] Below at first explain and illustrate with regard to the technical terms of the present invention:

[0043] Global average pooling: For a certain feature map, the average value is taken on the two-dimensional matrix of each channel as the average information of the channel. This is equivalent to representing a corresponding two-dimensional channel with an average. The global maximum poo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a language identification method of a scene text image in combination with global and local information. Basic features of a character image are extracted, and then global andlocal feature representations are extracted respectively; the global extraction branch uses global maximum pooling to express the whole graph as a vector, and category score prediction is carried out;probability prediction is performed on the local blocks of the image by the local aggregation branches respectively, and then the series of probability distributions are combined to obtain a categoryprediction score of a local level; and finally, global and local prediction scores are dynamically fused according to the branch prediction conditions to obtain a final identification result. According to the method, overall features and local differentiated features of the character images are noticed at the same time, and end-to-end training can be achieved in one step. Compared with an existing technology utilizing local features, the method has the advantages that the local differentiated features can be accurately extracted, excellent effects are achieved in the aspects of accuracy, operation efficiency and universality, and high practical application value is achieved.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to a language recognition method of a scene text image combining global and local information. Background technique [0002] The task of image language recognition is to identify the language of the text contained in a given text image. With the development of globalization, the task of language recognition plays an increasingly important role in today's multilingual systems. It can be seen from the results of the ICDAR MLT competition that the same model has great differences in the detection and recognition of characters in different languages. In many cases, we need to choose which language model to use for further text detection or recognition, and language recognition can help us provide automated choices. In the end-to-end multilingual text detection and recognition system that may appear in the future, the text language can be used as an important in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/62G06F17/27G06N3/04G06N3/08
CPCG06N3/084G06V20/63G06F40/263G06N3/045G06F18/241G06F18/214
Inventor 白翔程昌旭黄秋慧刘文予
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products