Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech optimization method and device applied to intelligent robot

An intelligent robot and voice technology, applied in the field of intelligent robots, can solve the problems of poor user experience, lack of emotion, strong voice, etc., and achieve the effect of improving interactive experience and good rhythm

Active Publication Date: 2020-01-14
BEIJING GUANGNIAN WUXIAN SCI & TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing intelligent robots have been able to answer users' questions or conduct simple chats with users through verbal communication. However, due to technical limitations, the sounds that robots make when communicating with users are mainly machine voices. It is tough and has no emotion, so the existing human-computer interaction process will bring bad experience to users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech optimization method and device applied to intelligent robot
  • Speech optimization method and device applied to intelligent robot
  • Speech optimization method and device applied to intelligent robot

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0027] figure 1 It is a schematic flowchart of Example 1 of the voice optimization method applied to an intelligent robot according to an embodiment of the present invention. Refer below figure 1 Each step of the method in this embodiment will be described.

[0028] In step S110, the user's multimodal input data is acquired.

[0029] It should be noted that the multimodal input data mainly includes audio data, video data, image data, and program instructions for enabling the robot to output certain actions or execute software or hardware. The combination of multimodal input data is relatively complex, and reliable or meaningful results can be obtained by analyzing the multimodal input data to determine the true intention of the multimodal data sender.

[0030] In this example, the multimodal input data can be acquired through the image acquisition system (such as a camera) and voice input system (such as a microphone) of the intelligent robot. For example, when the user in...

no. 2 example

[0054] In addition, the present invention also provides an embodiment, figure 2 It is a schematic flow chart of Example 2 of the speech optimization method applied to intelligent robots according to the present invention.

[0055] Steps S110, S120 and S130 of the method in this embodiment are similar to the first three steps of the first embodiment, and the difference from the first embodiment lies in step S140'. and figure 1 The same steps are represented by the same symbols in this example, and will not be repeated, only the difference between the two - step S140' will be described.

[0056] In step S140', when the set play time of the media file is satisfied, the corresponding media file and the TTS voice of the response message generated by the TTS system are output according to the set rules.

[0057] In this embodiment, the playing time of playing the media file is preset, for example, it is set to play the media file 3 seconds after the TTS voice is played. For exam...

no. 3 example

[0060] image 3 It is a structural block diagram of an embodiment of a speech optimization device 200 applied to an intelligent robot according to the present invention. like image 3 As shown, the device includes: a multimodal input unit 210 , a response unit 220 , an analysis unit 230 and a voice output unit 240 . Refer below image 3 To explain the various components of this device.

[0061] The multimodal input unit 210 is used for acquiring multimodal input data of the user.

[0062] In this example, the multimodal input unit 210 may be an image acquisition system (such as a camera) and a voice input system (such as a microphone) of the intelligent robot, through which multimodal input data is acquired. For example, when the user interacts with the robot by voice, the user sends voice information to the robot, and the unknown voice signal is converted into an electrical signal by a voice signal acquisition device such as a microphone and a microphone, and then input t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice optimization method and a voice optimization device applied to an intelligent robot. The voice optimization method comprises the steps of: acquiring multi-modal input data of a user; generating text information in response to the multi-modal input data; performing text analysis on the text information when determining that a set triggering rule is satisfied, and querying a corresponding media file and response information according to an analysis result; and outputting the media file and TTS voice generated by a TTS system in response to the response information according to a set rule. By outputting the media file and the TTS voice in a combined manner, the voice output of the robot has more features of a human language, the rhythm is good, the user feels comfortable, the capability of the robot is enhanced, and the interaction demand of users is satisfied.

Description

technical field [0001] The invention relates to the field of intelligent robots, in particular to a voice optimization method and device applied to intelligent robots. Background technique [0002] With the gradual popularization of intelligent robot products, more intelligent robots have entered the family, becoming children's playmates and adults' housekeepers. [0003] Existing intelligent robots have been able to answer users' questions or conduct simple chats with users through verbal communication. However, due to technical limitations, the voices of robots when communicating with users are still mainly machine voices. It is relatively tough and without emotion, so the existing human-computer interaction process will bring bad experience to users. [0004] Therefore, there is an urgent need to provide a solution that can optimize the sound experience, make the user interacting with the robot feel comfortable, improve the interaction ability of the intelligent robot, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/22G10L25/63
CPCG10L15/22G10L25/63G10L2015/225
Inventor 谢文静
Owner BEIJING GUANGNIAN WUXIAN SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products