Method for speech emotion recognition by utilizing emotion perception spectrum characteristics

A speech emotion recognition and spectral feature technology, applied in speech recognition, speech analysis, instruments, etc., to achieve the effect of improving effective resolution, removing redundant emotion features, and improving emotion recognition rate

Active Publication Date: 2018-11-20
湖南商学院 +1
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, emotional information is more delicate than voice content, and traditional spectral features such as MFCC and LPC are difficult to express closer emotional states, such as: sadness, fear

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for speech emotion recognition by utilizing emotion perception spectrum characteristics
  • Method for speech emotion recognition by utilizing emotion perception spectrum characteristics
  • Method for speech emotion recognition by utilizing emotion perception spectrum characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Below in conjunction with accompanying drawing, technical method of the present invention will be further described with specific embodiment:

[0022] see figure 1 The method for voice emotion recognition using emotion perception spectrum features provided by the embodiments of the present invention can automatically carry out the process by using computer software technical means, and specifically includes the following steps:

[0023] Step 1: Realize the preprocessing and time-frequency transformation of the speech signal: add window and divide the frame to the input speech signal first, the frame length is 1024, the frame shift is 256, and the window function is Hamming window or Hanning window. Considering the attenuation of the signal due to the stretching of the vocal tract muscles and the influence of breathing during the speech production process, it is necessary to enhance the high frequency of the speech signal. The enhancement method is to pre-emphasize each...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for speech emotion recognition by utilizing emotion perception spectrum characteristics. Firstly, a pre-emphasis method is used to perform high-frequency enhancementon an input speech signal, and then, the signal is converted into frequency field by using fast Fourier transform to obtain a speech frequency signal; the speech frequency signal is divided into a plurality of sub-bands by using an emotion perception sub-band dividing method; emotion perception spectrum characteristic calculation is carried out on each sub-band, wherein the spectrum characteristics comprises an emotion entropy characteristic, an emotional spectrum harmonic inclination degree and an emotional spectrum harmonic flatness; global statistical characteristic calculation is carried out on the spectrum characteristics to obtain a global emotion perception spectrum characteristic vector; finally, the emotion perceptual spectrum characteristic vector is input to an SVM classifier toobtain the emotion category of the speech signal. According to the speech psychoacoustic model principle, the perceptual sub-band dividing method is used to accurately describe the emotion state information, and emotion recognition is carried out through the sub-band spectrum characteristics, and the recognition rate is improved by 10.4% compared with prior MFCC characteristics.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a speech emotion recognition method of emotion perception spectrum features. Background technique [0002] Speech is the most important way for people to communicate. Speech signals not only contain rich semantic information, but also carry rich emotional states. Analyzing the emotional features in speech and using machine learning methods to identify the emotional state of speech can be applied in many scenarios, such as: in virtual reality, by recognizing human emotions to improve the naturalness of human-computer interaction; in car driving, Improve driving safety by identifying the driver's mental state; in medicine, provide diagnostic evidence by identifying the patient's mental state; in automatic customer service, improve customer service quality by identifying customer emotions. In recent years, with the rapid development of artificial intelligence and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/63G10L25/18G10L25/45G10L15/08
CPCG10L15/08G10L25/18G10L25/45G10L25/63
Inventor 姜林李小龙
Owner 湖南商学院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products