Dictation interaction method, system and device based on AI vision

An interactive method and dictation technology, applied in the field of artificial intelligence recognition interaction, can solve problems such as cumbersome operation, low efficiency, and slow recognition speed of dictation, and achieve the effect of enhancing user experience

Pending Publication Date: 2020-11-27
上海翎腾智能科技有限公司
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] This application provides a dictation interaction method, system, and device based on AI vision, which are configured to solve the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictation interaction method, system and device based on AI vision
  • Dictation interaction method, system and device based on AI vision
  • Dictation interaction method, system and device based on AI vision

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach 1

[0073] refer to Figure 1-2 As shown, an AI vision-based dictation interaction method proposed in this embodiment includes the following steps.

[0074] Step S100: Obtain in real time the collected target image including identifiable motion information and text information.

[0075] In this step S100, an image acquisition device is used to collect target images of the user within the field of view to perform non-contact human-computer interaction. The collection device may be a camera device or an image sensor device. The acquisition device collects high-definition current images of the pre-detection area in real time (the pre-detection area can be understood as the field of view). In one embodiment, a camera device is used to capture high-definition images in real time.

[0076] Step S200: Construct and train a plurality of convolutional deep neural networks and cyclic deep neural networks, or a combined structure of Transformer deep neural networks based on the self-atten...

Embodiment approach 2

[0124] Based on the above-mentioned dictation interaction method based on AI vision, this embodiment provides a specific solution, refer to the attached Figure 5 As shown, this embodiment provides a dictation interaction system based on AI vision.

[0125] The dictation interactive system based on AI vision includes an acquisition module 100, an identification module 200, a processing module 300, a voice module 400, and a display module 500; the identification module 200 is connected to the acquisition module 100 and the processing module 300, and the processing module 30 is connected to the display module 500, Voice module 400 is connected.

[0126] The acquisition module 100 is configured to receive in real time the acquired target image including identifiable motion information and text information.

[0127] The recognition module 200 is used to construct and train a plurality of convolutional deep neural networks and cyclic deep neural networks, or a combined structure o...

Embodiment approach 3

[0132] Based on the above-mentioned dictation interaction method based on AI vision, this embodiment provides another specific solution, refer to Figure 6-7 As shown, this embodiment provides a dictation interaction device based on AI vision. The device includes an AI recognition device 10 and an output device 20. The AI ​​recognition device 10 includes a camera device 11, a recognition device 12, a processing device 13, and an output device. 20 includes a display device 21 and a voice device 22 , the recognition device 12 is connected to the camera device 11 and the processing device 13 respectively, and the processing device 13 is connected to the display device 21 and the voice device 22 . Reference attached Figure 6 As shown, the display device 21 and the voice device 22 in this embodiment may use peripheral devices. The device can be designed as an integrated dictation interactive device, such as Figure 6 , can also be designed as a combined dictation interaction dev...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a dictation interaction method, system and device based on AI vision. The method comprises the steps of: S100, obtaining a collected target image in real time; S200: constructing and training a plurality of convolutional deep neural networks and cyclic deep neural networks, or based on a Transformer deep neural network combined structure of a self-attention mechanism, carrying out comprehensive weighting calculation on a plurality of combined structure output results for handwritten font recognition by utilizing a dynamically planned common substring matching algorithm,and recognizing action information and text information in the target image; S300, according to the identified action information, executing a dictation control task or a dictation control task; S400,controlling to play dictation content of the dictation task; and S500, controlling and displaying prompt content and a dictation result in the dictation task. Through multiple convolutional deep neural networks, interaction between gestures and dictation equipment is realized, the recognition accuracy is improved, the recognition speed is increased, and the use experience of a user is enhanced.

Description

technical field [0001] The invention relates to the field of artificial intelligence recognition interaction, in particular to a dictation interaction method, system, and device based on AI vision. Background technique [0002] Text dictation in language learning is an important link in the learning process. Existing tools require manual input of the content to be dictated, or manual dictation, and the dictation content needs to be prepared in advance, so the effect is low. [0003] The development of deep learning and big data has greatly improved the performance of artificial intelligence methods in image recognition, gesture recognition and text recognition. Applying techniques such as gesture recognition and character recognition to dictation in language learning through artificial intelligence can greatly improve people's language learning efficiency. [0004] In the prior art, an artificial intelligence-based method for assisted reading of children's picture books inc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/00G06K9/20G06N3/04G06N3/08
CPCG06N3/08G06V40/20G06V10/225G06V30/153G06N3/045
Inventor 高旻昱范骁骏侯瑞
Owner 上海翎腾智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products