Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Pixel-driven mobile phone operation interface text detection method

A technology for mobile phone operation and interface text, applied in the field of image processing, can solve problems such as inapplicable application scenarios, and achieve the effects of improving generalization, increasing receptive field, and improving convergence speed

Active Publication Date: 2020-04-10
易诚高科(大连)科技有限公司
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The methods proposed by researchers recently mainly focus on the distortion and projective transformation of text in natural scenes, but the text in the mobile phone interface almost does not have this problem, and it is not suitable for the application scenario of this scheme.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pixel-driven mobile phone operation interface text detection method
  • Pixel-driven mobile phone operation interface text detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] This scheme proposes a pixel-driven mobile phone operation interface text detection method. This method improves the existing EAST text detection model, and uses randomly generated mobile phone operation interface images to assist real images for training. Finally, through post-processing The method optimizes the detection results according to the application scenario. The image generation part of the mobile phone operation interface is shown in step 1, the model improvement part is shown in step 2, and the post-processing scheme is shown in step 3.

[0038] 1. The random text generation method based on the mobile phone operation interface template uses a small amount of manually marked mobile phone operation interface image data to obtain a corpus, and then generates a new image on the preset template. The specific steps are as follows:

[0039] 1.1 Text library establishment:

[0040] 1.1.1 Obtain the mobile phone operation interface image by means of screenshot;

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a pixel-driven mobile phone operation interface text detection method. The method comprises the following steps: 1) generating a mobile phone operation interface image; 2) improving a text detection model; and 3) correcting a detection box error based on a projection method. According to the pixel-driven mobile phone operation interface text detection method, the problem oflack of text marking data of an existing mobile phone operation interface is solved; the position and font size of the generated data are random within a certain range, and the generalization of themodel can be improved by training; an up-sampling layer is redesigned for the output of ResNet50, so that the deep feature proportion is improved; aiming at the condition that the mobile phone operation interface scene hardly has angle deviation, an angle regression branch of a regression part of the detection frame is removed and a loss function is correspondingly modified to improve the networkconvergence speed; the detection frame error correction method based on the projection method can assist in correcting the detection frame deviation under a pure color background, and the problems ofincomplete character detection, large blank area or the like are reduced as much as possible.

Description

technical field [0001] The invention relates to the technical field of image processing. Background technique [0002] The mobile phone interface contains three elements: text, icons, and pictures. In order for the computer to understand the semantics of all elements in the interface from bottom to top, it is necessary to divide the interface into several small pieces with independent semantics. Because the texture of the text area is complex and the aspect ratio is uncertain, it is usually impossible to completely identify the text area using a general-purpose target detection method, so a specific text detection method is required for text segmentation. [0003] At present, text detection methods are divided into methods based on the underlying features of images and methods based on deep learning. Methods based on the characteristics of the image itself (such as MSER, ECCV2008 and SWT, CVPR2010) detect text by analyzing the distribution of spot areas and strokes in the i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06F40/289G06N3/04G06N3/08
CPCG06N3/08G06V10/22G06N3/045Y02D30/70
Inventor 廖志梁王道宁陶亮马壮
Owner 易诚高科(大连)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products