Image correction and text and position identification method and system
A technology of image correction and recognition methods, applied in the field of image vision
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0084] Such as figure 1 , 2 As shown, step 100 is executed, and the input module 200 inputs the picture to be detected.
[0085] Step 110 is executed, the detection module 210 detects the text angle of the picture to be detected, fits a straight line, and counts the slope of the straight line, and uses the mode direction θ as the correction direction of the picture. Using the duality relationship between the point and the line, the discrete points in the image space are transformed into curves in the Hough space, and the intersection points of the curves are used as the parameters of the straight line equation, and the parameters are counted. The formula for the conversion is as follows:
[0086] ρ=x 1 cosθ+y 1 sinθ, where ρ represents the representation of pixels in Hough space, x 1 Indicates the abscissa of the pixel in the image space, y 1 Indicates the vertical coordinate of the pixel in the image space. Count the intersection points of the curves converted to Hou...
Embodiment 2
[0093] An image rectification and text and position recognition model method, comprising the following steps:
[0094] The first step, for the input picture (such as image 3 As shown), detect the text angle of the picture, fit the straight line, and count the slope of the straight line, and use the mode direction as the correction direction of the picture. Using the dual relationship between points and lines, the discrete points in the image space are converted into curves in Hough space, and the intersection points of the curves are used as parameters of the straight line equation. The conversion equation is as follows:
[0095]
[0096] Statistically convert the intersection point of the curve into Hough space. If it exceeds the threshold, it is considered as the text direction, and the parameters (ρ, θ) are recorded, and the mode of the parameter is further counted, and θ is used as the rotation angle.
[0097] The second step is to use the affine transformation matrix...
Embodiment 3
[0116] This patent proposes an image-based text information and its position detection and recognition system OCR (optical character recognition) to meet the application needs of various industries and bring better experience to users. OCR (optical character recognition) is one of the applications of image-based sequence recognition. Image-based sequence recognition has been a long-term research topic in the field of computer vision. Characters, and then the process of translating the shape into computer text by character recognition; that is, the process of scanning the text data, and then analyzing and processing the image file to obtain the text and layout information. In order to better apply OCR technology to different scene recognition, firstly, the image to be detected is rotated to improve the accuracy of target area detection in the neural network, thereby improving the accuracy of text recognition and detection. Pure text recognition technology cannot satisfy all OCR...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com