Non-directional white-box attack resisting method for scene character recognition

A text recognition, non-directional technology, applied in the field of computer vision, can solve the problems of low attack efficiency, can not be directly recognized by the application scene text, large disturbance of the adversarial sample, etc., to achieve the effect of small noise disturbance, fast speed, and reduced number of iterations

Inactive Publication Date: 2021-05-25
HUAZHONG UNIV OF SCI & TECH
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Different from general image pictures, the content of scene text pictures is a sequence of characters, so although there have been many adversarial studies on general images in recent years, they cannot be directly applied to scene text recognition.
In addition, the current mainstream adversarial attack methods can be divided into two categories. The first is the gradient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Non-directional white-box attack resisting method for scene character recognition
  • Non-directional white-box attack resisting method for scene character recognition
  • Non-directional white-box attack resisting method for scene character recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the purpose, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0041] Below at first explain and illustrate with regard to the technical terms of the present invention:

[0042] ResNet: A neural network that can be used for classification. The network is mainly composed of convolutional layers, pooling layers, and shortcut connection layers. The convolutional layer is used to extract image features; the function of the pooling layer is to reduce the dimen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a non-directional white-box attack resisting method for scene character recognition, which is characterized in that noise which is difficult to perceive by human eyes is added into an input image, so that a character recognition algorithm obtains a recognition result which is totally different from the original recognition result, and the effect of attacking a scene character recognition network model is achieved. Different from common object recognition, a character recognition result is a character sequence, so that an existing adversarial attack method cannot be directly applied to scene character recognition. Therefore, the method comprises the following steps: firstly, an objective function in a single object classification adversarial attack algorithm is modified into a sequence form; in order to accelerate generation of an adversarial sample, a step function is introduced in, and once a character is different from the recognition result of the original image in the recognition result of the adversarial sample, the penalty of the target function is zero; besides, in consideration of different attack difficulty levels of different samples, the more easily recognized correctly the samples are, the more difficultly attacked the samples are, recognition scores are introduced into the target function, so that adversarial samples with smaller disturbance are obtained.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to a non-directional white-box confrontation attack method for scene text recognition. Background technique [0002] In the field of computer vision, scene text recognition is a challenging task because of the variety of text appearances and the complexity of natural scenes. Thanks to the development of deep learning and a large amount of training data, scene text recognition has achieved impressive development in recent years. However, recent adversarial studies have shown that deep learning models are vulnerable to small perturbations in the input data, leading to large changes in the model output. As one of the most practical tasks in computer vision, scene text recognition still faces huge security risks. But so far there has been no work on adversarial attacks on scene text recognizers with attention mechanisms. Therefore, research on this aspect is i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/08G06K9/62G06F17/18
CPCG06N3/084G06F17/18G06N3/044G06N3/045G06F18/214
Inventor 白翔杨明锟
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products