Self-learning spectrogram feature extraction method for speech emotion recognition

A technology for speech emotion recognition and feature extraction, used in speech recognition, speech analysis, instruments, etc.

Active Publication Date: 2015-11-11
SOUTHEAST UNIV
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the research on the correlation of speech signals often only studies one domain in the frequency domain or time domain, and there are few literatures that combine the time-frequency domain correlation of speech signals for research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-learning spectrogram feature extraction method for speech emotion recognition
  • Self-learning spectrogram feature extraction method for speech emotion recognition
  • Self-learning spectrogram feature extraction method for speech emotion recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below in combination with specific embodiments.

[0044] The present invention provides a kind of self-learning spectrogram feature extraction method that is used for speech emotion recognition, and concrete steps are as follows:

[0045] 1) Spectrogram analysis and preprocessing

[0046] Preprocessing speech from a standard corpus of known emotions

[0047] (1) Carry out framing and windowing on the speech, and calculate the discrete Fourier transform.

[0048] X = Σ n = 0 N - 1 x ( n ) ω ( n ) e - 2 π j N ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a self-learning spectrogram feature extraction method for speech emotion recognition. The method is characterized by, to begin with, carrying out preprocessing on speech, of which the emotion is known, in a standard corpus to obtain a quantitative spectrogram gray level image; then, calculating a Gabor spectrogram of the obtained spectrogram gray level image; carrying out training on an extracted LBP statistical histogram by utilizing a recognizable characteristic learning algorithm and constructing a global significance pattern set with different scales and different directions; and finally, carrying out feature selection on the LBP statistical histograms of the Gabor spectrograms under different scales and different directions of the speech by utilizing the global significance pattern set to obtain processed statistical histograms, and cascading the N statistical histograms to obtain speech emotion characteristics suitable for emotion classification. The emotion features can recognize different types of emotions, and recognition rate thereof is substantially superior to that of existing acoustic features.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a feature extraction method applied to a speech emotion recognition system. Background technique [0002] Voice, as one of the most important means of communication, has received more and more attention in the new field of human-computer interaction. In order to make the human-computer interaction system and the dialogue system of the robot more intelligent and perfect, the emotional analysis of voice is becoming more and more important. important. In addition, in some long-term, monotonous, high-intensity tasks (such as spaceflight, navigation, etc.), relevant personnel often have certain negative emotions. Effective identification of these negative emotions helps to improve individual cognition and work Efficiency, prevent problems before they happen. Speech emotion recognition can also play an important role in criminal investigation and intelligent assist...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/10G10L15/20
Inventor 赵力陶华伟魏昕梁瑞宇查诚张昕然
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products