Speech emotion recognition method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech emotion recognition and emotion technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of not fully considering the fuzzy characteristics of speech emotion information, and achieve the effect of solving the difficulty of selection and determination, and improving the significance

Inactive Publication Date: 2017-06-13

SOUTH CHINA UNIV OF TECH

View PDF4 Cites 60 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, there is a shortcoming that only using these deep neural network models to extract features and then perform classification and recognition does not fully consider and utilize the fuzzy characteristics of speech emotional information, which is quite important in speech emotional features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with specific examples.

[0035] The voice emotion recognition method provided in this embodiment is specifically based on a deep neural network model and feature fuzzy optimization, such as figure 1 Shown, this speech emotion recognition method comprises the following steps:

[0036] 1) Convert the speech signal into a spectrogram as the original input;

[0037] 2) Train deep convolutional neural networks (DCNNs) to automatically extract emotional features;

[0038] 3) Train a stacked autoencoder (SAE) for each type of emotion and fuse all stacked autoencoders to automatically construct the membership function of the emotional fuzzy set;

[0039] 4) The fuzzy optimization theory in step 3) is used to carry out feature optimization to the feature obtained in step 2);

[0040] 5) Use Softmax classifier for emotion classification and identification.

[0041] In step 1), the speech signal is converted...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech emotion recognition method. The method includes the steps that firstly, a speech signal is converted into a spectrogram to serve as initial input; secondly, a deep convolutional neural network is trained to automatically extract emotion features; thirdly, a stack type auto-encoder is trained for each kind of emotions, and all the stack type auto-encoders are fused to automatically construct membership functions of an emotion fuzzy set; fourthly, the features obtained in the second step are subjected to feature optimization by means of the fuzzy optimal theory in the third step; fifthly, emotion classification recognition is conducted by means of a Softmax classifier. The method takes abstract fuzzy properties of speech emotion information into consideration, the extracted emotion features are subjected to selective fuzzy optimization to improve the significance of the features, fuzzy membership functions in the fuzzy theory are automatically constructed by means of the concept of deep neural network layer-by-layer training, and the problem that the proper membership functions in the fuzzy theory are difficult to select and determine is solved.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a speech emotion recognition method based on a deep neural network model and feature fuzzy optimization. Background technique [0002] Human beings can express emotions through many signals, such as heartbeat frequency, voice, face, behavior, etc. Computers can identify and obtain human emotional states by analyzing one or more of these signals, among which speech is the most important and convenient way of communication in daily life. With the rapid development of the field of computer multimedia information processing technology and the field of artificial intelligence, various research institutions pay more and more attention to how to make computers recognize human voice emotions. [0003] Emotion recognition of speech belongs to the field of pattern recognition, but it is slightly different. For example, for ordinary image recognition, many small animals...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L25/63G10L25/30G10L25/33G10L15/06G10L15/16G10L19/20

CPCG10L25/63G10L15/063G10L15/16G10L19/20G10L25/30G10L25/33

Inventor徐健成肖南峰

OwnerSOUTH CHINA UNIV OF TECH

Speech emotion recognition method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology