The invention discloses an
emotion recognition method,
system and device based on multi-
modal feature fusion and a medium, and the method comprises the steps: obtaining preset first voice information and corresponding first visual information, and carrying out the
feature extraction of the first voice information and the first visual information, and obtaining a voice feature image and an
expression feature image; performing
feature fusion on the voice feature image and the
expression feature image to obtain a first multi-
modal feature, and constructing a training
data set according to the first multi-
modal feature; inputting the training
data set into a pre-constructed
convolutional neural network for training to obtain a trained multi-modal
feature recognition model; and identifying the emotion of the person to be tested according to the multi-modal feature identification model. On one hand, the
model complexity is reduced, the model training and
emotion recognition efficiency is improved, on the other hand, the influence of the voice features and the expression features on the
emotion recognition result of the model is considered, the emotion recognition accuracy is improved, and the emotion recognition method can be widely applied to the technical field of emotion recognition.