Multi-modal continuous emotion recognition method, service reasoning method and system
An emotion recognition and multi-modal technology, applied in the field of service robots, can solve problems such as poor robustness, scarce data sets, and low recognition accuracy, and achieve the effect of improving satisfaction, accuracy, and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0055]This embodiment discloses a multimodal continuous emotion recognition method based on expression and voice, such as figure 1 shown, including the following steps:
[0056] Step 1: Obtain video data including user facial expressions and voice;
[0057] This embodiment chooses to conduct experimental verification on the AVEC2013 data set. The AVEC2013 database is a public data set provided by the third audio-visual emotion challenge competition, which not only contains facial expression and voice emotion data, but also has such figure 2 Sentiment labels for the two continuous dimensions of Arousal and Valence shown.
[0058] Step 2: Based on the pre-trained face recognition model, extract face images for emotion recognition; specifically include:
[0059] Step 2.1: Use the convolutional neural network based on the cascaded architecture to realize face detection and discard abnormal frames in the expression video frame, and extract the face image;
[0060] First, by co...
Embodiment 2
[0107] The purpose of this embodiment is to provide a multimodal continuous emotion recognition system based on expression and voice, including:
[0108] A data acquisition module configured to acquire video data including user facial expressions and voice;
[0109] The expression and emotion recognition module is configured to extract face images from the video image sequence, perform feature extraction on the face images, and obtain expression and emotion features; perform continuous emotion recognition based on the pre-trained deep learning model according to the expression and emotion features;
[0110] The speech emotion recognition module is configured to use Mel-frequency cepstral coefficients to obtain speech emotion features for the speech data; perform continuous emotion recognition based on the pre-trained transfer learning network according to the speech emotion features;
[0111] The data fusion module is configured to fuse expression emotion recognition results a...
Embodiment 3
[0116] The purpose of this embodiment is to provide a computer-readable storage medium.
[0117] A computer-readable storage medium stores a computer program thereon, and when the program is executed by a processor, the method described in Embodiment 1 is implemented.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com