Voice interaction and voice wake-up detection method and device, equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of voice interaction and equipment, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as unfriendly users

Pending Publication Date: 2020-06-16

ALIBABA GRP HLDG LTD

View PDF19 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The current voice interaction scheme mainly considers how to improve the accuracy of voice recognition, but ignores the fact that the essence of voice interaction is to provide convenience for users, making the existing voice interaction scheme unfriendly to users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0058] Embodiment 1, speech speed imitation

[0059] Taking the first feature referring to speech rate (for ease of distinction, it may be referred to as "first speech rate") as an example, the speech rate of the speech output to the user feedback (for ease of distinction, may be referred to as "second speech rate") ”) to be adjusted to be the same as the first speech rate, or close to the first speech rate.

[0060] Adjusting the second speech rate to be close to the first speech rate means that the adjustment can be made within a predetermined upper and lower range of the first speech rate. Wherein, the upper and lower predetermined ranges may be predetermined ratios, or predetermined numerical values. For example, suppose the first rate of speech is V 0 , can be in the interval [V 0 -V 1 , V 0 +V 2 ] to adjust the second speech rate, where, V 1 , V 2 are constants, which can be the same or different. For another example, suppose the first speaking rate is V 0 , ca...

Embodiment 2

[0062] Embodiment 2, volume imitation

[0063] Taking the volume referred to by the first feature (for ease of distinction, it may be referred to as "first volume") as an example, the volume of the voice output fed back to the user (for ease of distinction, may be referred to as "second volume") may be adjusted to The same as the first volume, or close to the first volume.

[0064] Adjusting the second volume to be close to the first volume means that the adjustment can be made within a predetermined range above and below the first volume. Wherein, the upper and lower predetermined ranges may be predetermined ratios, or predetermined numerical values. For example, assuming the first volume is C 0 , can be in the interval [C 0 -C 1 , C 0 +C 2 ] to adjust the second volume, where, C 1 、C 2 are constants, which can be the same or different. For another example, suppose the first volume is C 0 , can be in the interval [(1-α)C 0 , (1+β)C 0 ] to adjust the second volume,...

Embodiment 3

[0072] Embodiment 3, intelligently reduce the volume

[0073] As an example, the trigger condition for intelligent volume reduction may be set to satisfy one or more of the following conditions.

[0074] (1) The current time satisfies the second predetermined condition. The second predetermined condition may be a condition representing a time range, such as 9:00 p.m. to 8:00 a.m., or working hours (such as 9:00 a.m. to 5:00 p.m.) on weekdays (Monday to Friday).

[0075] (2) The device is in Do Not Disturb mode. The Do Not Disturb mode can be silent mode, vibrate mode, and so on. The ON / OFF of the Do Not Disturb mode can be actively set by the user.

[0076] (3) No voice input is received within the second predetermined time period. That is, there is no voice input within the second predetermined time period. Wherein, the value of the second predetermined duration can be set according to the actual situation, for example, it can be 10 minutes.

[0077] (4) The system volu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice interaction and voice wake-up detection method and device, equipment and a storage medium. The method comprises the steps of analyzing the voice input of a user; and adjusting parameters related to the voice interaction based on an analysis result. The method can be used for voice interaction after the equipment is awakened, for example, voice characteristics such as voice speed, volume, tone and timbre of voice input of a user can be analyzed, and the emotional state of the voice input of the user can also be analyzed. Based on the analysis result, the relatedparameters of the voice output fed back to the user can be correspondingly adjusted to improve the interaction experience of the user. The method can also be used for voice wake-up detection, under the condition that a wake-up detection result is lower than a current wake-up threshold value, the parameter of the wake-up threshold value can be lowered, for example, the wake-up threshold value can be lowered in the next period of time, and therefore the wake-up success rate in the next period of time can be increased, and the wake-up experience of a user can be improved.

Description

technical field [0001] The present invention relates to the field of voice interaction, in particular to a voice interaction and voice wake-up detection method, device, equipment and storage medium. Background technique [0002] Voice interaction belongs to the category of human-computer interaction, and it is a relatively cutting-edge interaction method that has been developed to the present. Voice interaction is the process in which users give instructions to the machine through natural language to achieve their own goals. The current voice interaction scheme mainly considers how to improve the accuracy of voice recognition, but ignores that the essence of voice interaction is to provide convenience for users, making the existing voice interaction schemes unfriendly to users. [0003] Therefore, an improved voice interaction solution is needed to provide users with a more comfortable interaction experience. Contents of the invention [0004] An object of the present in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/22G10L21/034

CPCG10L15/22G10L21/034G10L2015/223G10L2015/225

Inventor王德淼孟伟

OwnerALIBABA GRP HLDG LTD

Voice interaction and voice wake-up detection method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology