Speech spectrum characteristic extracting method facing speech emotion identification

A speech emotion recognition and feature extraction technology, applied in speech analysis, instruments, etc., can solve problems such as research and lack of speech emotion recognition algorithms, and achieve the effects of reducing differences, improving distinctions, and reducing error rates

Inactive Publication Date: 2015-05-20
NANJING INST OF TECH
View PDF5 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Spectrum-based research mainly includes sound classification, sound recognition, sound enhancement, etc., but there is no algorithm research on speech emotion recognition based on spectral features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech spectrum characteristic extracting method facing speech emotion identification
  • Speech spectrum characteristic extracting method facing speech emotion identification
  • Speech spectrum characteristic extracting method facing speech emotion identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0038] Such as figure 1 As shown, a spectral feature extraction method for speech emotion recognition includes the following steps:

[0039] In step 1, the speech signal is divided into frames, and fast Fourier transform is performed to obtain the corresponding spectrogram.

[0040] Step 2, decompose the spectrogram

[0041] The image is convolved with the linear decomposition Gaussian kernel, and different channels are decomposed on different scales to obtain a multi-channel and multi-scale decomposed image; the channels here include color channels, brightness channels and direction channels.

[0042] The relationship between decomposed images on different scales of the same channel is P(σ)=...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech spectrum characteristic extracting method facing speech emotion identification. The method comprises the first step of framing a speech signal and conducting the fast Fourier transformation to obtain a corresponding speech spectrum, the second step of resolving the speech spectrum, the third step of conducting the central peripheral subtract calculation on the resolved image and normalizing to obtain a characteristic image of each resolving image, the fourth step of extracting the characteristic matrix of each characteristic image and a fifth step of conducting dimensionality reduction on the characteristic matrix and reconstituting. The method comprehensively uses the image processing methods from the perspective of analyzing the speech spectrum characteristic, excavates the emotion identification characteristic from a creative perspective, uses a multiscale and multichannel filter to resolve the speech spectrum, conducts the processing in different characteristic fields, and combines the PCA analysis to better excavate beneficial information to speech emotion.

Description

technical field [0001] The invention relates to a speech spectrum feature extraction method for speech emotion recognition, which belongs to the technical field of speech emotion recognition. Background technique [0002] With the development of human-computer interaction technology, speech emotion recognition has become one of the key technologies. In order to make the human-computer interaction system and the dialogue system of the robot more intelligent and perfect, the emotion analysis of speech becomes more and more important. In addition, in some long-term, monotonous, and high-intensity tasks (such as spaceflight, navigation, etc.), relevant personnel often have certain negative emotions. Effective identification of these negative emotions can help improve individual cognition and work efficiency, and prevent problems before they happen. Early emotion analysis for children has gradually become an important research direction of speech emotion recognition. Therefore...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/63G10L25/03
Inventor 梁瑞宇冯月芹唐闺臣王青云花涛包永强陈姝顾保府
Owner NANJING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products