Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech emotion recognition method based on variational modal decomposition and extreme learning machine

A technology of variational mode decomposition and speech emotion recognition, which is applied in speech analysis, instruments, etc., can solve problems such as poor performance, and achieve the effect of good comprehensive performance and high recognition rate

Inactive Publication Date: 2018-09-28
NANJING UNIV OF INFORMATION SCI & TECH
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing speech emotion features still have the problem of poor performance in classification and recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech emotion recognition method based on variational modal decomposition and extreme learning machine
  • Speech emotion recognition method based on variational modal decomposition and extreme learning machine
  • Speech emotion recognition method based on variational modal decomposition and extreme learning machine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0038] The present embodiment adopts two kinds of speech emotion data sets (EMODB, RAVDESS) shared anger, sad, fear, happy, neutral five kinds of emotion each 50 sentences. Among them, 40 sentences are randomly selected for training, 10 sentences are used for testing, and 10 experiments are carried out. The average recognition rate of the 10 experiments is used as the evaluation index for the experimental results. The variational mode decomposition (Variational Mode Decomposition, VMD) algorithm is combined with the extreme learning machine (Extreme Learning Machine, ELM) classification algorithm for speech emotion classification and recognition. The flow chart of the speech emotion classification and recognition method based on variational mode dec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech emotion recognition method based on variational modal decomposition and an extreme learning machine, and belongs to the field of artificial intelligence and speech recognition. The speech emotion recognition method first preprocesses an emotional speech signal through a variational modal decomposition method, the emotional speech signal is decomposed into a plurality of intrinsic mode function (IMF) components and a residual component, and the components can reflect the change of a original sequence more accurately and retain the emotional characteristics of the speech signals; then, the IMF components are subjected to hilbert conversion to obtain hillbert marginal spectrum characteristics of the IMF components; and in addition, the IMF components are reaggregated to obtain the speech signal removing the residual component, and then an MEL cepstrum function is extracted for the signal. The extracted new characteristics are added into a traditional speech emotional characteristic set, and an extreme learning machine model is constructed for classification and recognition. The speech emotion recognition method has the advantage of obtaining new speechcharacteristics through the variational modal decomposition. Compared with the traditional speech emotional characteristics, the characteristics have a higher recognition rate for speech emotion recognition.

Description

technical field [0001] The invention relates to the fields of artificial intelligence and speech recognition, in particular to a speech emotion recognition method based on variational mode decomposition and extreme learning machine. Background technique [0002] Among the many communication methods, voice signals are the fastest natural method for human-to-human and human-to-machine communication. Humans can even sense the speaker's emotional state from spoken communication. Speech emotion is a method of analyzing vocal behavior and refers to indicators of various influences (such as mood, emotion, and stress), focusing on the nonverbal aspects of speech. In this context, the main challenge of speech emotion recognition (SER) is to extract some objective and measurable speech feature parameters that can reflect the speaker's emotional state. In recent years, speech emotion recognition has received extensive attention in the fields of human-computer communication, robot com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/63G10L25/24G10L25/21G10L25/18
CPCG10L25/18G10L25/21G10L25/24G10L25/63
Inventor 张秀再王玮蔚赵慧
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products