Speech optimization method and device applied to intelligent robot
An intelligent robot and voice technology, applied in the field of intelligent robots, can solve the problems of poor user experience, lack of emotion, strong voice, etc., and achieve the effect of improving interactive experience and good rhythm
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0027] figure 1 It is a schematic flowchart of Example 1 of the voice optimization method applied to an intelligent robot according to an embodiment of the present invention. Refer below figure 1 Each step of the method in this embodiment will be described.
[0028] In step S110, the user's multimodal input data is acquired.
[0029] It should be noted that the multimodal input data mainly includes audio data, video data, image data, and program instructions for enabling the robot to output certain actions or execute software or hardware. The combination of multimodal input data is relatively complex, and reliable or meaningful results can be obtained by analyzing the multimodal input data to determine the true intention of the multimodal data sender.
[0030] In this example, the multimodal input data can be acquired through the image acquisition system (such as a camera) and voice input system (such as a microphone) of the intelligent robot. For example, when the user in...
no. 2 example
[0054] In addition, the present invention also provides an embodiment, figure 2 It is a schematic flow chart of Example 2 of the speech optimization method applied to intelligent robots according to the present invention.
[0055] Steps S110, S120 and S130 of the method in this embodiment are similar to the first three steps of the first embodiment, and the difference from the first embodiment lies in step S140'. and figure 1 The same steps are represented by the same symbols in this example, and will not be repeated, only the difference between the two - step S140' will be described.
[0056] In step S140', when the set play time of the media file is satisfied, the corresponding media file and the TTS voice of the response message generated by the TTS system are output according to the set rules.
[0057] In this embodiment, the playing time of playing the media file is preset, for example, it is set to play the media file 3 seconds after the TTS voice is played. For exam...
no. 3 example
[0060] image 3 It is a structural block diagram of an embodiment of a speech optimization device 200 applied to an intelligent robot according to the present invention. like image 3 As shown, the device includes: a multimodal input unit 210 , a response unit 220 , an analysis unit 230 and a voice output unit 240 . Refer below image 3 To explain the various components of this device.
[0061] The multimodal input unit 210 is used for acquiring multimodal input data of the user.
[0062] In this example, the multimodal input unit 210 may be an image acquisition system (such as a camera) and a voice input system (such as a microphone) of the intelligent robot, through which multimodal input data is acquired. For example, when the user interacts with the robot by voice, the user sends voice information to the robot, and the unknown voice signal is converted into an electrical signal by a voice signal acquisition device such as a microphone and a microphone, and then input t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com