Unlock instant, AI-driven research and patent intelligence for your innovation.

A Speech Emotion Recognition Method Fused with Long Span Emotion History

An emotion and span technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of not being able to fully reflect the key role of emotional historical information, not being able to model contextual information, and classification performance not as good as discriminative classifiers, etc., to achieve real-time performance Good, high recognition accuracy, simple operation

Active Publication Date: 2016-07-27
中科极限元(杭州)智能科技股份有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since HMM is a generative classification model, its classification performance is not as good as that of discriminative classifiers.
At the same time, it cannot model long-span contextual information, that is, the fusion range of emotional history is limited, and it cannot fully reflect the key role of emotional history information in emotion recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Speech Emotion Recognition Method Fused with Long Span Emotion History
  • A Speech Emotion Recognition Method Fused with Long Span Emotion History

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0011] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0012] It should be noted that, in the drawings or descriptions of the specification, similar or identical parts all use the same figure numbers. Implementations not shown or described in the accompanying drawings are forms known to those of ordinary skill in the art. It should be pointed out that the described examples are only considered for the purpose of illustration and not limitation of the present invention.

[0013] figure 1 It is a flow chart of a speech emotion recognition method that fuses long-span emotion history information proposed by the present invention, as figure 1 As shown, the speech emotion recognition method of the described fusion long-span emotion history comprises the following steps:

[0014]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech emotion recognition method which integrates long-span emotion history. The method comprises the following steps: using different parameters in the time domain and transform domain to perform endpoint detection, eliminating non-speech data in the original speech sequence, and obtaining speech segment data to be recognized; dividing the speech segment data to be recognized into independent speech segments Data unit; use the first support vector machine to perform preliminary emotional state classification on the speech segment data unit respectively; add window to the preliminary classification result of the emotional state, and use the second support vector machine to perform fusion to obtain emotional recognition with long-span emotional history result. While ensuring high-precision classification of local units of speech signals, the present invention makes full use of context information in long spans of signal sequences to achieve optimal classification results for each unit in the sequence. The invention can be used for emotion recognition of speech signals, and has the advantages of good real-time performance, greatly improved recognition accuracy and the like.

Description

technical field [0001] The invention belongs to the field of speech signal processing, and in particular relates to a speech emotion recognition method which integrates long-span emotion history, and thereby improves the accuracy of continuous speech emotion recognition. Background technique [0002] For decades, researchers at home and abroad have done a lot of research work on speech emotion recognition and proposed many effective algorithms for emotion recognition. These methods can be divided into detection methods based on static classifiers and detection methods based on dynamic classifiers in terms of processing strategies. Detection methods based on static classifiers mostly use support vector machines (SVM), neural networks, Boosting, etc., and most of these classifiers are discriminative models. Due to its strong discrimination ability, it is widely used in the field of emotional state recognition, but this method ignores the interconnection between the emotional ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/63G10L15/08G10L15/06
Inventor 陶建华杨明浩巢林林
Owner 中科极限元(杭州)智能科技股份有限公司