Speech spectrum characteristic extracting method facing speech emotion identification

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech emotion recognition and feature extraction technology, applied in speech analysis, instruments, etc., can solve problems such as research and lack of speech emotion recognition algorithms, and achieve the effects of reducing differences, improving distinctions, and reducing error rates

Inactive Publication Date: 2015-05-20

NANJING INST OF TECH

View PDF5 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Spectrum-based research mainly includes sound classification, sound recognition, sound enhancement, etc., but there is no algorithm research on speech emotion recognition based on spectral features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0038] Such as figure 1 As shown, a spectral feature extraction method for speech emotion recognition includes the following steps:

[0039] In step 1, the speech signal is divided into frames, and fast Fourier transform is performed to obtain the corresponding spectrogram.

[0040] Step 2, decompose the spectrogram

[0041] The image is convolved with the linear decomposition Gaussian kernel, and different channels are decomposed on different scales to obtain a multi-channel and multi-scale decomposed image; the channels here include color channels, brightness channels and direction channels.

[0042] The relationship between decomposed images on different scales of the same channel is P(σ)=...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech spectrum characteristic extracting method facing speech emotion identification. The method comprises the first step of framing a speech signal and conducting the fast Fourier transformation to obtain a corresponding speech spectrum, the second step of resolving the speech spectrum, the third step of conducting the central peripheral subtract calculation on the resolved image and normalizing to obtain a characteristic image of each resolving image, the fourth step of extracting the characteristic matrix of each characteristic image and a fifth step of conducting dimensionality reduction on the characteristic matrix and reconstituting. The method comprehensively uses the image processing methods from the perspective of analyzing the speech spectrum characteristic, excavates the emotion identification characteristic from a creative perspective, uses a multiscale and multichannel filter to resolve the speech spectrum, conducts the processing in different characteristic fields, and combines the PCA analysis to better excavate beneficial information to speech emotion.

Description

technical field [0001] The invention relates to a speech spectrum feature extraction method for speech emotion recognition, which belongs to the technical field of speech emotion recognition. Background technique [0002] With the development of human-computer interaction technology, speech emotion recognition has become one of the key technologies. In order to make the human-computer interaction system and the dialogue system of the robot more intelligent and perfect, the emotion analysis of speech becomes more and more important. In addition, in some long-term, monotonous, and high-intensity tasks (such as spaceflight, navigation, etc.), relevant personnel often have certain negative emotions. Effective identification of these negative emotions can help improve individual cognition and work efficiency, and prevent problems before they happen. Early emotion analysis for children has gradually become an important research direction of speech emotion recognition. Therefore...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L25/63G10L25/03

Inventor梁瑞宇冯月芹唐闺臣王青云花涛包永强陈姝顾保府

OwnerNANJING INST OF TECH

Speech spectrum characteristic extracting method facing speech emotion identification

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology