Book point reading method and system based on deep learning
A deep learning and book point technology, applied in the field of book point reading, can solve the problems of cumbersome operation, fixed image and text recognition and poor accuracy, so as to improve accuracy, simplify point reading detection and recognition methods, and avoid point reading errors. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] The book point reading method based on deep learning provided by the embodiment of the present invention can be applied to the field of text detection and recognition, such as figure 1 As shown, including the following steps:
[0029] Step S11: Obtain a finger point reading image taken by the image acquisition device.
[0030] Finger point reading recognition is divided into finger detection and text detection. Traditional finger detection requires gestures to draw a rectangle in the text area to determine the point reading area, and recognize the rectangular area based on algorithms such as skin color segmentation, and finally detect the text in the rectangular area. However, in the embodiment of the present invention, the image acquisition device is used to capture the text that needs to be clicked, without using a point-reading pen, only a finger is needed to point to the area that needs to be clicked, and it is simpler than the traditional finger swipe area detection.
[0...
Embodiment 2
[0071] The embodiment of the present invention provides a book point reading system based on deep learning, such as Image 6 Shown, including:
[0072] The image acquisition module 1 acquires the finger point reading image taken by the image acquisition device; this module executes the method described in step S1 in the embodiment 1, which will not be repeated here.
[0073] Fingertip position and text detection module 2, which inputs the finger point reading image into the finger detection model and text detection model at the same time, respectively detects the fingertip position and all text areas in the image; this module executes the step S2 described in embodiment 1 The method will not be repeated here.
[0074] The text area cutting module 3 combines the detected fingertip position and the text area to perform affine transformation on the text area to cut the text area; this module executes the method described in step S3 in embodiment 1, and will not be repeated here. .
[00...
Embodiment 3
[0080] The embodiment of the present invention provides a computer device, such as Figure 7 As shown, it includes: at least one processor 401, such as a CPU (Central Processing Unit, central processing unit), at least one communication interface 403, memory 404, and at least one communication bus 402. Among them, the communication bus 402 is used to implement connection and communication between these components. The communication interface 403 may include a display (Display) and a keyboard (Keyboard), and the optional communication interface 403 may also include a standard wired interface and a wireless interface. The memory 404 may be a high-speed RAM memory (Ramdom Access Memory, volatile random access memory), or a non-volatile memory (non-volatile memory), such as at least one disk memory. Optionally, the memory 404 may also be at least one storage device located far away from the aforementioned processor 401. The processor 401 can execute the book point reading method b...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com