Text recognition method and device, computer device and computer readable storage medium
By training a feature extraction model using unlabeled text image samples and combining DenseNet and multi-head attention mechanisms, the problem of poor recognition performance and high training difficulty of OCR models in font deformation scenarios is solved, achieving more efficient text recognition capabilities.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- TENCENT TECHNOLOGY (SHENZHEN) CO LTD
- Filing Date
- 2021-08-17
- Publication Date
- 2026-06-16
AI Technical Summary
Existing OCR models have poor recognition performance when faced with font deformation in different scenarios, and obtaining training samples requires a lot of manpower, resulting in high training difficulty.
The training effect of the feature extraction model is enhanced by training it with unlabeled text image samples. The DenseNet neural network and multi-head attention mechanism are used for image feature extraction and attention feature extraction. The training sample index is calculated and predicted by combining image attribute information.
It improves the recognition ability of OCR models in different scenarios, reduces the training difficulty, reduces the dependence on labeled samples, and enhances the generalization ability of feature extraction models.
Smart Images

Figure CN115909336B_ABST