Voice motion recognition method based on feature selection and optimization

A technology for speech emotion recognition and feature selection, applied in speech analysis, instruments, etc., can solve the problems of no general standard, difficult selection, and high computational complexity, and achieve the effect of improving accuracy and strong separability

Inactive Publication Date: 2019-03-19
XIDIAN UNIV
View PDF6 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in the emotion recognition method of Mandarin speech based on PCA and SVM, its emotion feature selection algorithm is principal component analysis PCA, and multiple SVMs are used for emotion recognition of speech, because the PCA algorithm is not suitable for feature selection of samples with non-Gaussian distribution , so this support vector machine SVM speech emotion recognition method is only suitable for the case of small sample size, for the case of large sample size, the kernel function mapping dimension is very high, the calcu

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice motion recognition method based on feature selection and optimization
  • Voice motion recognition method based on feature selection and optimization
  • Voice motion recognition method based on feature selection and optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be described in detail below with reference to the drawings and specific embodiments.

[0031] Refer to figure 1 , The voice emotion recognition method based on feature selection and optimization of the present invention includes two parts: extracting emotional features of voice and recognizing emotional voice. The steps are as follows:

[0032] 1. Extraction of voice emotional features

[0033] Step 1. Build a voice database to get the original voice.

[0034] This example selects the four emotional voices of happiness, anger, fear, and sadness from the casia Chinese emotional corpus as the original voices to form a voice database; the database has 800 voices, 200 each of the four emotional voices of anger, fear, happiness and sadness Sentences, these four emotional voices correspond to four category tags, there are four professional speakers, two males and two females, the voice format is wav format, the sampling period is 16kHz, the sampling accura...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice emotion recognition method based on feature selection and optimization, and mainly solves the problem of low accuracy rate of speech emotion recognition in the prior art. According to the scheme, the method includes the following steps of: 1) establishing an emotion voice database and obtaining original voice; 2) carrying out preemphasis, windowing by frames and end-point detection and other preprocessing on the original voice; 3) extracting emotion characteristic parameters of the pre-treated voice; 4) selecting the optimal parameters of the voice emotion by using a random forest algorithm; 5) inputting the optimal voice emotion characteristic parameters into a trained convolution neural network to obtain the result of the voice emotion recognition. By analyzing the importance of the voice emotion characteristics, the optimal voice emotion characteristic parameters are obtained, the accuracy of the convolution neural network algorithm to voice emotion recognition is improved, and the method can be applied to mobile phone communication, human-computer interaction and recognition of speaker emotion in medical diagnosis and criminal investigation.

Description

Technical field [0001] The invention belongs to the technical field of voice signal processing, and particularly relates to a voice emotion recognition method, which can be used for mobile phone communication, human-computer interaction, medical diagnosis and criminal investigation. Background technique [0002] The earliest research related to speech emotion recognition appeared in the mid-1980s. This is the first time that humans use acoustic features to study and analyze emotional features. In 1999, Moriyama proposed that there is a linear correlation model between speech and emotion, and realized the initial application of speech emotion in e-commerce. Since the beginning of the 21st century, the research of speech emotion recognition has been developed in various fields and gradually applied to the fields of human-computer interaction, mobile phone communication and clinical medicine, which has attracted worldwide attention. [0003] Speech emotion recognition is the process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L25/63G10L25/24G10L25/30G10L25/45
CPCG10L25/24G10L25/30G10L25/45G10L25/63
Inventor 陈建春李欢欢王金鹏吴琴乜亮
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products