Kernel fuzzy c-means speech emotion identification method combined with secondary identification of support vector machine

A speech emotion recognition and support vector machine technology, applied in the field of emotion recognition, can solve the problem that the predicted label is not optimal

Inactive Publication Date: 2012-09-12
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the mixed attribute of emotion should be considered when designing a speech emotion recognition system, so the previous recognition method is not optimal only to give the unknown emotion a separate prediction label (see literature: A Framework for Automatic Human Emotion Classification Using Emotion Profiles, Emily M., Narayanan, S., IEEE Trans On Audio, Speech and Language Processing, 2011, vol.19, pp.1507-1520)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Kernel fuzzy c-means speech emotion identification method combined with secondary identification of support vector machine
  • Kernel fuzzy c-means speech emotion identification method combined with secondary identification of support vector machine
  • Kernel fuzzy c-means speech emotion identification method combined with secondary identification of support vector machine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Firstly, 30 sentences are selected from the four emotion corpora of happiness, anger, neutral and sadness in the German speech emotion database respectively as training corpus, and the rest as test corpus. The selected corpus is pre-emphasized, framed, and windowed. The frame length is 30ms, and the frame is shifted by 15ms. After the above processing, the 12-dimensional Mel cepstrum coefficient is used as the characteristic parameterized speech, and each sentence is only taken 80 frames of short speech features of about 1.4s, and each frame removes the first Mel cepstrum coefficient, so that each emotional training feature parameter and test feature parameter are obtained; in addition, 16-dimensional linear prediction coefficients are used as feature parameterization For speech, only 80-frame-length features are used as the training and testing feature parameters for secondary recognition. After obtaining the characteristic parameters, proceed as follows:

[0033] Step 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a kernel fuzzy c-means speech emotion identification method combined with the secondary identification of a support vector machine. The kernel fuzzy c-means speech emotion identification method comprises the following steps of: firstly, using a Mel-frequency cepstral coefficient to map to a high dimension characteristic space by a nonlinear kernel function; obtaining a clustering center by utilizing the kernel fuzzy c-means and using the clustering center as a code book quantized by a vector; then estimating a sample category label by an average fuzzy weighted minimum vector quantized error rule; and after estimating the label, carrying out secondary identification on pleasure and anger by utilizing a support vector machine method and a linear prediction coefficient. The kernel fuzzy c-means speech emotion identification method is an important composition part for automatically identifying the emotion in a man-machine interactive system, is a first step of carrying out emotional interaction between the machine and a person and has important application prospect for a system which uses people as the design center.

Description

Technical field [0001] The invention belongs to the cross field of speech processing and artificial intelligence, and particularly relates to an emotion recognition method in an intelligent human-computer interaction system. Background technique [0002] At present, speech emotion pattern recognition methods based on traditional statistical methods, or machine learning such as neural networks, these traditional classic methods can only achieve the ideal recognition rate when the sample data is large enough, so when the actual sample data When less or limited, the engineering application of traditional methods will be limited. In addition, it is well known that the expressions of natural emotions and the presented states are often more ambiguous or mixed (see literature: Emotion classification from speech using evaluator reliability-weighted combination of ranked-lists, Audhkhasi, K., Narayanan, S. , In ICASS, Issue: July, 2011, pp: 4956-4959.). Therefore, when designing a speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
Inventor 黄杰王良翼徐斌何文洲
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products