Speech emotion recognition method and system based on ternary loss

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech emotion recognition and emotion technology, applied in the field of emotion recognition, can solve problems such as long input and unsatisfactory experimental results

Active Publication Date: 2018-12-14

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF5 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In addition, there is a thorny problem in speech emotion recognition, which is the problem of variable input length. Traditional machine learning methods require fixed-length input information. The general method is to truncate longer samples and fill shorter samples with 0. However, The experimental results of these practices are not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0090] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0091] A ternary loss-based emotion recognition method provided by the present invention will be described below with reference to the accompanying drawings.

[0092] figure 1 The main steps of an emotion recognition method based on ternary loss in this embodiment are exemplarily shown, such as figure 1 As shown, a kind of emotion recognition method based on ternary loss in the present embodiment may comprise the following steps:

[0093] Step S101: Perform frame processing on the speech data to be tested to obtain a speech sequence of a specific length.

[0094] Specifically, according to the preset time threshold, the speech data to be te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of emotion recognition, particularly relates to a speech emotion recognition method and system based on ternary loss, and aims at solving the technical problem of accurately recognizing confusable emotion categories. For this purpose, the speech emotion recognition method comprises the following steps: framing the speech data to be measured so as to obtain a speech sequence of the specific length; temporal coding is performed based on the preset emotional temporal coding network and according to the speech sequence so as to obtain the emotional feature vectors corresponding to the speech sequence; and predicting the emotion category corresponding to the emotion feature vectors based on the preset speech emotion classifier and according to multiple preset real emotion categories. According to the speech emotion recognition method, the confusable speech emotion categories can be greatly recognized, and the method can be executed and implemented by the speech emotion recognition system.

Description

technical field [0001] The invention belongs to the technical field of emotion recognition, and in particular relates to a speech emotion recognition method and system based on ternary loss. Background technique [0002] Speech emotion recognition has a wide range of applications in human-computer interaction and artificial intelligence, and is a key research direction in the field of human-computer interaction and artificial intelligence. Speech emotion recognition mainly includes two parts, speech emotion feature extraction and speech emotion recognition model training. Most speech emotion recognition methods focus on extracting robust and effective speech emotion features and finding effective emotion recognition models. However, emotions are characterized by ambiguity, and some emotions are particularly easy to be confused with each other, such as the two categories of "angry" and "disgusted", and the two categories of "surprise" and "sad". [0003] In addition, there ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/63G10L25/30

CPCG10L25/30G10L25/63

Inventor陶建华黄健李雅

OwnerINST OF AUTOMATION CHINESE ACAD OF SCI

Speech emotion recognition method and system based on ternary loss

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology