Speech emotion recognition method and system based on semi-supervised adversarial variation self-coding

A speech emotion recognition and semi-supervised technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as the quality of emotional feature representation needs to be improved, performance input data disturbance, weak generalization ability, etc., to improve accuracy and generalization ability, the ability to improve feature distribution, and the effect of improving feature representation quality
CN112863494AActive Publication Date: 2021-05-28HUNAN UNIV

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
HUNAN UNIV
Publication Date
2021-05-28

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a speech emotion recognition method and system based on semi-supervised adversarial variation self-encoding, and the method comprises the steps: S1, constructing a generative adversarial network, and constructing a speech emotion recognition model through the combination of a semi-supervised variation self-encoding model and the generative adversarial network, wherein data with emotion labels in input data and corresponding emotion labels are used as input, data without emotion labels in the input data are used as emotion label attribute missing types for processing, feature probability distribution of the input data in a hidden layer is learned through the generative adversarial network, and an SSAVAE model is constructed; S2, training the constructed SSAVAE model by using a training set; and S3, inputting to-be-processed speech emotion data, and inputting the to-be-processed speech emotion data into the trained SSAVAE model to obtain an emotion recognition result. The method has the advantages of being simple in implementation method, high in recognition precision, good in generalization ability and data disturbance resistance and the like.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of speech emotion recognition, in particular to a method and system for speech emotion recognition based on semi-supervised confrontational variational self-encoding. Background technique

[0002] Speech emotion recognition aims to extract emotion-related features from speech signals, identify the emotional state of the current speaker, and enhance the naturalness of human-computer interaction. It can be widely used in human-computer interaction, voice customer service, vehicle-mounted systems, etc. Scenes. Speech emotion recognition is one of the tasks belonging to pattern recognition. Different supervised learning models can be used to construct speech emotion recognition systems with good recognition performance, such as hidden Markov models, Gaussian mixture models, support vector machines, etc. However, the above models are all shallow model structures, which limit model learning Deep Emotional Featur...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More