Scene character recognition method based on visual language modeling network
A visual language and text recognition technology, applied in character recognition, character and pattern recognition, reasoning methods, etc., can solve the problem that it is difficult to fully consider and effectively integrate text recognition, large additional computing overhead, scene text recognition speed and accuracy need to be improved and other issues to achieve the effect of improving recognition ability and enhancing visual features
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0020] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
[0021] Embodiments of the present invention provide a scene text recognition method based on a visual language modeling network, such as figure 1 As shown, it mainly includes:
[0022] Construct a visual model including a backbone network, a position-aware mask generation module and a visual semantic reasoning module, and use the position-aware mask generation module to guide the visual semantic reasoning module to deduce the occluded character inf...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


