Dictation interaction method, system and device based on AI vision
An interactive method and dictation technology, applied in the field of artificial intelligence recognition interaction, can solve problems such as cumbersome operation, low efficiency, and slow recognition speed of dictation, and achieve the effect of enhancing user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach 1
[0073] refer to Figure 1-2 As shown, an AI vision-based dictation interaction method proposed in this embodiment includes the following steps.
[0074] Step S100: Obtain in real time the collected target image including identifiable motion information and text information.
[0075] In this step S100, an image acquisition device is used to collect target images of the user within the field of view to perform non-contact human-computer interaction. The collection device may be a camera device or an image sensor device. The acquisition device collects high-definition current images of the pre-detection area in real time (the pre-detection area can be understood as the field of view). In one embodiment, a camera device is used to capture high-definition images in real time.
[0076] Step S200: Construct and train a plurality of convolutional deep neural networks and cyclic deep neural networks, or a combined structure of Transformer deep neural networks based on the self-atten...
Embodiment approach 2
[0124] Based on the above-mentioned dictation interaction method based on AI vision, this embodiment provides a specific solution, refer to the attached Figure 5 As shown, this embodiment provides a dictation interaction system based on AI vision.
[0125] The dictation interactive system based on AI vision includes an acquisition module 100, an identification module 200, a processing module 300, a voice module 400, and a display module 500; the identification module 200 is connected to the acquisition module 100 and the processing module 300, and the processing module 30 is connected to the display module 500, Voice module 400 is connected.
[0126] The acquisition module 100 is configured to receive in real time the acquired target image including identifiable motion information and text information.
[0127] The recognition module 200 is used to construct and train a plurality of convolutional deep neural networks and cyclic deep neural networks, or a combined structure o...
Embodiment approach 3
[0132] Based on the above-mentioned dictation interaction method based on AI vision, this embodiment provides another specific solution, refer to Figure 6-7 As shown, this embodiment provides a dictation interaction device based on AI vision. The device includes an AI recognition device 10 and an output device 20. The AI recognition device 10 includes a camera device 11, a recognition device 12, a processing device 13, and an output device. 20 includes a display device 21 and a voice device 22 , the recognition device 12 is connected to the camera device 11 and the processing device 13 respectively, and the processing device 13 is connected to the display device 21 and the voice device 22 . Reference attached Figure 6 As shown, the display device 21 and the voice device 22 in this embodiment may use peripheral devices. The device can be designed as an integrated dictation interactive device, such as Figure 6 , can also be designed as a combined dictation interaction dev...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com