Speech emotion recognition method and system based on semi-supervised adversarial variation self-coding

A speech emotion recognition and semi-supervised technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as the quality of emotional feature representation needs to be improved, performance input data disturbance, weak generalization ability, etc., to improve accuracy and generalization ability, the ability to improve feature distribution, and the effect of improving feature representation quality

Active Publication Date: 2021-05-28
HUNAN UNIV
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005](1) Since we usually only focus on learning the common representation of labeled data and unlabeled data in the feature space, the learned common representation is the low-dimensional feature of the input data Mapping, so the generalization ability of the above semi-supervised learning method is weak, and its performance is easily affected by the disturbance of the input data
[0006] (2) The quality of emotional feature representation will directly affect the recognition performance of the model. The above-mentioned model constructed using the semi-supervised learning method cannot fully represent emotional features. Representation quality still needs to be improved, affecting the accuracy of speech emotion recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech emotion recognition method and system based on semi-supervised adversarial variation self-coding
  • Speech emotion recognition method and system based on semi-supervised adversarial variation self-coding
  • Speech emotion recognition method and system based on semi-supervised adversarial variation self-coding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.

[0052] Such as figure 1 As shown, the steps of the speech emotion recognition method based on semi-supervised adversarial variational self-encoding in this embodiment include:

[0053] S1. SSAVAE model construction: build a generative confrontation network GAN and combine the semi-supervised variational autoencoder model SSVAE and the generative confrontation network to construct a speech emotion recognition model, in which the input data with emotional label data and the corresponding emotional label are used as input, Make the generated hidden layer features conform to the distribution characteristics of emotional labels, and treat the data without emotional labels in the input data as the missing type of emotional label attributes, that is, use the emotional ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech emotion recognition method and system based on semi-supervised adversarial variation self-encoding, and the method comprises the steps: S1, constructing a generative adversarial network, and constructing a speech emotion recognition model through the combination of a semi-supervised variation self-encoding model and the generative adversarial network, wherein data with emotion labels in input data and corresponding emotion labels are used as input, data without emotion labels in the input data are used as emotion label attribute missing types for processing, feature probability distribution of the input data in a hidden layer is learned through the generative adversarial network, and an SSAVAE model is constructed; S2, training the constructed SSAVAE model by using a training set; and S3, inputting to-be-processed speech emotion data, and inputting the to-be-processed speech emotion data into the trained SSAVAE model to obtain an emotion recognition result. The method has the advantages of being simple in implementation method, high in recognition precision, good in generalization ability and data disturbance resistance and the like.

Description

technical field [0001] The invention relates to the technical field of speech emotion recognition, in particular to a method and system for speech emotion recognition based on semi-supervised confrontational variational self-encoding. Background technique [0002] Speech emotion recognition aims to extract emotion-related features from speech signals, identify the emotional state of the current speaker, and enhance the naturalness of human-computer interaction. It can be widely used in human-computer interaction, voice customer service, vehicle-mounted systems, etc. Scenes. Speech emotion recognition is one of the tasks belonging to pattern recognition. Different supervised learning models can be used to construct speech emotion recognition systems with good recognition performance, such as hidden Markov models, Gaussian mixture models, support vector machines, etc. However, the above models are all shallow model structures, which limit model learning Deep Emotional Featur...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/06G10L15/26G10L25/27G10L25/63
CPCG10L15/063G10L15/26G10L25/27G10L25/63
Inventor 赵欢肖宇锋王松高迎雪
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products