Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method for Speech Emotion Recognition Using Emotion Perceptual Spectrum Features

A speech emotion recognition and spectral feature technology, applied in speech recognition, speech analysis, instruments, etc., to remove redundant emotion features, improve emotion recognition rate, and improve effective resolution.

Active Publication Date: 2022-05-17
湖南商学院 +1
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, emotional information is more delicate than voice content, and traditional spectral features such as MFCC and LPC are difficult to express closer emotional states, such as: sadness, fear

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Speech Emotion Recognition Using Emotion Perceptual Spectrum Features
  • A Method for Speech Emotion Recognition Using Emotion Perceptual Spectrum Features
  • A Method for Speech Emotion Recognition Using Emotion Perceptual Spectrum Features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Below in conjunction with accompanying drawing, technical method of the present invention will be further described with specific embodiment:

[0022] see figure 1 The method for voice emotion recognition using emotion perception spectrum features provided by the embodiments of the present invention can automatically carry out the process by using computer software technical means, and specifically includes the following steps:

[0023] Step 1: Realize the preprocessing and time-frequency transformation of the speech signal: add window and divide the frame to the input speech signal first, the frame length is 1024, the frame shift is 256, and the window function is Hamming window or Hanning window. Considering the attenuation of the signal due to the stretching of the vocal tract muscles and the influence of breathing during the speech production process, it is necessary to enhance the high frequency of the speech signal. The enhancement method is to pre-emphasize each...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for speech emotion recognition using emotion perception spectrum features. Firstly, a pre-emphasis method is used to perform high-frequency enhancement on the input speech signal, and then the fast Fourier transform is used to convert it to a frequency to obtain a speech frequency signal; and then for the speech frequency The signal is divided into multiple sub-bands by using the emotion-aware sub-band division method; the emotional-aware spectral features are calculated for each sub-band, and the spectral features include emotional entropy features, emotional spectrum harmonic inclinations, and emotional spectrum harmonic flatness; The spectral features are calculated by global statistical features to obtain the global emotion perception spectrum feature vector; finally, the emotion perception spectrum feature vector is input to the SVM classifier to obtain the emotion category of the speech signal. According to the principle of the speech psychoacoustic model, the invention adopts the perceptual sub-band division method to accurately describe the emotional state information, and performs emotional recognition through the sub-band spectral features, and improves the recognition rate by 10.4% compared with the traditional MFCC features.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a speech emotion recognition method of emotion perception spectrum features. Background technique [0002] Speech is the most important way for people to communicate. Speech signals not only contain rich semantic information, but also carry rich emotional states. Analyzing the emotional features in speech and using machine learning methods to identify the emotional state of speech can be applied in many scenarios, such as: in virtual reality, by recognizing human emotions to improve the naturalness of human-computer interaction; in car driving, Improve driving safety by identifying the driver's mental state; in medicine, provide diagnostic evidence by identifying the patient's mental state; in automatic customer service, improve customer service quality by identifying customer emotions. In recent years, with the rapid development of artificial intelligence and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/63G10L25/18G10L25/45G10L15/08
CPCG10L15/08G10L25/18G10L25/45G10L25/63
Inventor 姜林李小龙
Owner 湖南商学院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products