Voice endpoint detection method for emotion recognition, electronic equipment and storage medium

An endpoint detection and emotion recognition technology, which is applied in the fields of speech endpoint detection, electronic equipment and storage media, and emotion recognition, can solve problems such as poor emotion recognition effect, unsatisfactory detection effect, and small calculation amount, so as to improve recognition and enhance The effect of robustness and generalization ability

Active Publication Date: 2020-05-19
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF7 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional VAD technology mainly detects based on the short-term energy, zero-crossing rate, cepstrum feature or entropy of the audio. These methods are simple in principle and

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint detection method for emotion recognition, electronic equipment and storage medium
  • Voice endpoint detection method for emotion recognition, electronic equipment and storage medium
  • Voice endpoint detection method for emotion recognition, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0047] Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0048] The invention provides a speech endpoint detection method for emotion recognition, which is applied to an electronic device. refer to figure 1 As shown, it is a schematic diagram of an application environment of a preferred embodiment of the speech endpoint detection method for emotion recognition in the present invention.

[0049] In this embodiment, the electronic device 1 may be a server, a mobile phone, a tablet computer, a portable computer, a desktop computer, and other terminal clients with computing functions.

[0050] The electronic device 1 includes a memory 11 , a processor 12 , a network interface 13 and a communication bus 14 .

[0051] The memory 11 includes at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to voice semantics, and provides a voice endpoint detection method for emotion recognition. The method comprises: acquiring an audio signal; performing processing operation on the audio signal, comprising adding pure noise sections and human voice noise sections in various scenes to the audio signal and randomly setting a signal-to-noise ratio; extracting an MFCC feature anda second-order difference feature of the processed audio signal; inputting the features into a neural network model, and extracting high-dimensional information of the audio signal and front and backassociated features of the audio signal; inputting the extracted high-dimensional information and the associated features of the audio signal into a full-connection network model to obtain a detectionresult of each frame of the audio signal, the detection result comprising human voice and non-human voice; and segmenting the audio signal into a human voice part and a non-human voice part accordingto the detection result of the audio signal. The invention further provides electronic equipment and a storage medium. Accurate voice endpoint detection can be realized in a low signal-to-noise ratioenvironment and a non-stationary environment.

Description

technical field [0001] The invention relates to the technical field of speech semantics, and more specifically, to a speech endpoint detection method for emotion recognition, electronic equipment and a storage medium. Background technique [0002] Before performing speech emotion recognition, it is necessary to accurately identify the endpoint position of the human voice in a long audio, so as to separate the environmental noise from the speaking voice. This technology is voice endpoint detection (VAD), which is a driving force. Speech signal processing technology. Studies have shown that if the start and end positions of speakers can be accurately identified and segmented, the accuracy of subsequent speech tasks can be effectively improved. The traditional VAD technology mainly detects based on the short-term energy, zero-crossing rate, cepstrum feature or entropy of the audio. These methods are simple in principle and have a small amount of calculation. The recognition e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L25/84G10L25/87G10L25/30G10L25/24G10L25/03G10L25/63
CPCG10L25/03G10L25/24G10L25/30G10L25/63G10L25/84G10L25/87
Inventor 王德勋徐国强
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products