Speech emotion recognition method and system based on ternary loss

A speech emotion recognition and emotion technology, applied in the field of emotion recognition, can solve problems such as long input and unsatisfactory experimental results

Active Publication Date: 2018-12-14
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF5 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In addition, there is a thorny problem in speech emotion recognition, which is the problem of variable input length. Traditional machine learning methods require fixed-

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech emotion recognition method and system based on ternary loss
  • Speech emotion recognition method and system based on ternary loss
  • Speech emotion recognition method and system based on ternary loss

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0091] A ternary loss-based emotion recognition method provided by the present invention will be described below with reference to the accompanying drawings.

[0092] figure 1 The main steps of an emotion recognition method based on ternary loss in this embodiment are exemplarily shown, such as figure 1 As shown, a kind of emotion recognition method based on ternary loss in the present embodiment may comprise the following steps:

[0093] Step S101: Perform frame processing on the speech data to be tested to obtain a speech sequence of a specific length.

[0094] Specifically, according to the preset time threshold, the speech data to be te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of emotion recognition, particularly relates to a speech emotion recognition method and system based on ternary loss, and aims at solving the technical problem of accurately recognizing confusable emotion categories. For this purpose, the speech emotion recognition method comprises the following steps: framing the speech data to be measured so as to obtain a speech sequence of the specific length; temporal coding is performed based on the preset emotional temporal coding network and according to the speech sequence so as to obtain the emotional feature vectors corresponding to the speech sequence; and predicting the emotion category corresponding to the emotion feature vectors based on the preset speech emotion classifier and according to multiple preset real emotion categories. According to the speech emotion recognition method, the confusable speech emotion categories can be greatly recognized, and the method can be executed and implemented by the speech emotion recognition system.

Description

technical field [0001] The invention belongs to the technical field of emotion recognition, and in particular relates to a speech emotion recognition method and system based on ternary loss. Background technique [0002] Speech emotion recognition has a wide range of applications in human-computer interaction and artificial intelligence, and is a key research direction in the field of human-computer interaction and artificial intelligence. Speech emotion recognition mainly includes two parts, speech emotion feature extraction and speech emotion recognition model training. Most speech emotion recognition methods focus on extracting robust and effective speech emotion features and finding effective emotion recognition models. However, emotions are characterized by ambiguity, and some emotions are particularly easy to be confused with each other, such as the two categories of "angry" and "disgusted", and the two categories of "surprise" and "sad". [0003] In addition, there ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L25/63G10L25/30
CPCG10L25/30G10L25/63
Inventor 陶建华黄健李雅
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products