Speech emotion recognition method

A speech emotion recognition and emotion technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of not fully considering the fuzzy characteristics of speech emotion information, and achieve the effect of solving the difficulty of selection and determination, and improving the significance

Inactive Publication Date: 2017-06-13
SOUTH CHINA UNIV OF TECH
View PDF4 Cites 60 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is a shortcoming that only using these deep neural network models to extract features and then perform classification and recognition does not fully consider and utilize the fuzzy characteristics of speech emotional information, which is quite important in speech emotional features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech emotion recognition method
  • Speech emotion recognition method
  • Speech emotion recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with specific examples.

[0035] The voice emotion recognition method provided in this embodiment is specifically based on a deep neural network model and feature fuzzy optimization, such as figure 1 Shown, this speech emotion recognition method comprises the following steps:

[0036] 1) Convert the speech signal into a spectrogram as the original input;

[0037] 2) Train deep convolutional neural networks (DCNNs) to automatically extract emotional features;

[0038] 3) Train a stacked autoencoder (SAE) for each type of emotion and fuse all stacked autoencoders to automatically construct the membership function of the emotional fuzzy set;

[0039] 4) The fuzzy optimization theory in step 3) is used to carry out feature optimization to the feature obtained in step 2);

[0040] 5) Use Softmax classifier for emotion classification and identification.

[0041] In step 1), the speech signal is converted...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech emotion recognition method. The method includes the steps that firstly, a speech signal is converted into a spectrogram to serve as initial input; secondly, a deep convolutional neural network is trained to automatically extract emotion features; thirdly, a stack type auto-encoder is trained for each kind of emotions, and all the stack type auto-encoders are fused to automatically construct membership functions of an emotion fuzzy set; fourthly, the features obtained in the second step are subjected to feature optimization by means of the fuzzy optimal theory in the third step; fifthly, emotion classification recognition is conducted by means of a Softmax classifier. The method takes abstract fuzzy properties of speech emotion information into consideration, the extracted emotion features are subjected to selective fuzzy optimization to improve the significance of the features, fuzzy membership functions in the fuzzy theory are automatically constructed by means of the concept of deep neural network layer-by-layer training, and the problem that the proper membership functions in the fuzzy theory are difficult to select and determine is solved.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a speech emotion recognition method based on a deep neural network model and feature fuzzy optimization. Background technique [0002] Human beings can express emotions through many signals, such as heartbeat frequency, voice, face, behavior, etc. Computers can identify and obtain human emotional states by analyzing one or more of these signals, among which speech is the most important and convenient way of communication in daily life. With the rapid development of the field of computer multimedia information processing technology and the field of artificial intelligence, various research institutions pay more and more attention to how to make computers recognize human voice emotions. [0003] Emotion recognition of speech belongs to the field of pattern recognition, but it is slightly different. For example, for ordinary image recognition, many small animals...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/63G10L25/30G10L25/33G10L15/06G10L15/16G10L19/20
CPCG10L25/63G10L15/063G10L15/16G10L19/20G10L25/30G10L25/33
Inventor 徐健成肖南峰
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products