A speech emotion recognition method and device based on domain confrontation

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech emotion recognition and field technology, applied in speech analysis, neural learning methods, character and pattern recognition, etc., to achieve the effect of good classification and high recognition accuracy

Active Publication Date: 2022-03-08

SOUTHEAST UNIV

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Most of the existing methods process speech signals from two perspectives: the frame scale and the entire sentence scale, and few methods consider combining the above two scales

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] This embodiment provides a speech emotion recognition method based on domain confrontation, such as figure 1 and figure 2 shown, including:

[0034] (1) Obtain a speech emotion database storing several speech signals and corresponding emotion category labels, and divide it into a source domain database and a target domain database.

[0035] Among them, the method of dividing the source domain database and the target domain database is the Leave-One-Subject-Out Cross Validation method: the voice signal belonging to any person in the voice emotion database and the corresponding emotion category label are used as the target domain database , and the speech signals and corresponding emotion category labels of all others are used as the source domain database.

[0036] (2) For each speech signal in the source domain database and the target domain database, extract its IS10 feature as the global feature of the corresponding speech signal.

[0037] Among them, the IS10 fea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice emotion recognition method and device based on domain confrontation. The method includes: (1) acquiring a voice emotion database and dividing it into a source domain database and a target domain database; (2) extracting IS10 features for each voice signal As a global feature; (3) divide the speech signal into several short segments overlapping 50% before and after according to time, and extract the IS10 feature of each short segment; (4) input the IS10 features of all short segments into the two-way long-short-term memory model, and then input Enter the attention mechanism model, and output as local features; (5) connect global features and local features in series as joint features; (6) build neural networks, including domain discriminators and sentiment classifiers; (7) train neural networks, The total loss of the network is the loss of the emotion classifier minus the loss of the domain discriminator; (8) Obtain the joint features of the speech signal to be recognized, input the trained neural network, and obtain the predicted emotion category. The identification result of the present invention is more accurate.

Description

technical field [0001] The invention relates to speech emotion recognition technology, in particular to a speech emotion recognition method and device based on domain confrontation. Background technique [0002] Speech emotion recognition is a hot research problem in the field of affective computing, with broad application prospects. Since speech signals have unique sequence properties, speech emotion recognition can be viewed as a dynamic or static classification problem. Most existing methods process speech signals from two perspectives: the frame scale and the entire sentence scale, and few methods consider combining the above two scales. The difficulty of speech emotion recognition lies in extracting appropriate speech emotion features and reducing the feature distribution difference between source domain database (training database) data and target domain database (test database) data. Contents of the invention [0003] Purpose of the invention: The present inventio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L25/63G10L25/30G06K9/62G06N3/08

CPCG10L25/63G10L25/30G06N3/08G06F18/24G06F18/214

Inventor郑文明郑婉璐宗源路成

OwnerSOUTHEAST UNIV

A speech emotion recognition method and device based on domain confrontation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology