Voice motion recognition method based on feature selection and optimization

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for speech emotion recognition and feature selection, applied in speech analysis, instruments, etc., can solve the problems of no general standard, difficult selection, and high computational complexity, and achieve the effect of improving accuracy and strong separability

Inactive Publication Date: 2019-03-19

XIDIAN UNIV

View PDF6 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For example, in the emotion recognition method of Mandarin speech based on PCA and SVM, its emotion feature selection algorithm is principal component analysis PCA, and multiple SVMs are used for emotion recognition of speech, because the PCA algorithm is not suitable for feature selection of samples with non-Gaussian distribution , so this support vector machine SVM speech emotion recognition method is only suitable for the case of small sample size, for the case of large sample size, the kernel function mapping dimension is very high, the calculation complexity is high, and the speech emotion recognition rate is low, and for The choice of kernel function for nonlinear problems is difficult because there is no general standard

As another example, the convolutional neural network CNN speech emotion recognition method that has appeared in recent years, because it only uses spectral features such as Mel-frequency cepstral coefficients MFCC and spectrograms, without adding voice quality features and prosody features, it still cannot be obtained. More accurate speech emotion recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] The present invention will be described in detail below with reference to the drawings and specific embodiments.

[0031] Refer to figure 1 , The voice emotion recognition method based on feature selection and optimization of the present invention includes two parts: extracting emotional features of voice and recognizing emotional voice. The steps are as follows:

[0032] 1. Extraction of voice emotional features

[0033] Step 1. Build a voice database to get the original voice.

[0034] This example selects the four emotional voices of happiness, anger, fear, and sadness from the casia Chinese emotional corpus as the original voices to form a voice database; the database has 800 voices, 200 each of the four emotional voices of anger, fear, happiness and sadness Sentences, these four emotional voices correspond to four category tags, there are four professional speakers, two males and two females, the voice format is wav format, the sampling period is 16kHz, the sampling accura...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice emotion recognition method based on feature selection and optimization, and mainly solves the problem of low accuracy rate of speech emotion recognition in the prior art. According to the scheme, the method includes the following steps of: 1) establishing an emotion voice database and obtaining original voice; 2) carrying out preemphasis, windowing by frames and end-point detection and other preprocessing on the original voice; 3) extracting emotion characteristic parameters of the pre-treated voice; 4) selecting the optimal parameters of the voice emotion by using a random forest algorithm; 5) inputting the optimal voice emotion characteristic parameters into a trained convolution neural network to obtain the result of the voice emotion recognition. By analyzing the importance of the voice emotion characteristics, the optimal voice emotion characteristic parameters are obtained, the accuracy of the convolution neural network algorithm to voice emotion recognition is improved, and the method can be applied to mobile phone communication, human-computer interaction and recognition of speaker emotion in medical diagnosis and criminal investigation.

Description

Technical field [0001] The invention belongs to the technical field of voice signal processing, and particularly relates to a voice emotion recognition method, which can be used for mobile phone communication, human-computer interaction, medical diagnosis and criminal investigation. Background technique [0002] The earliest research related to speech emotion recognition appeared in the mid-1980s. This is the first time that humans use acoustic features to study and analyze emotional features. In 1999, Moriyama proposed that there is a linear correlation model between speech and emotion, and realized the initial application of speech emotion in e-commerce. Since the beginning of the 21st century, the research of speech emotion recognition has been developed in various fields and gradually applied to the fields of human-computer interaction, mobile phone communication and clinical medicine, which has attracted worldwide attention. [0003] Speech emotion recognition is the process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/63G10L25/24G10L25/30G10L25/45

CPCG10L25/24G10L25/30G10L25/45G10L25/63

Inventor 陈建春李欢欢王金鹏吴琴乜亮

Owner XIDIAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice motion recognition method based on feature selection and optimization

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology