Supercharge Your Innovation With Domain-Expert AI Agents!

Multi-modal depression detection method and system based on time convolutional neural network

A convolutional neural network and detection method technology, applied in the field of multimodal depression detection methods and systems, can solve problems such as limiting the performance of depression detection systems, low performance, and pauses.

Pending Publication Date: 2021-05-18
杭州医典智能科技有限公司
View PDF9 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing technology, detection methods based on biochemical reagents and EEG are usually used, while technical solutions based on voice, text or images are mostly based on voice data. During clinical interviews, patients may stutter and often Pauses between words, resulting in longer audio, video recordings than non-depressed patients
In short, the existing technology mainly has the following problems: In terms of the amount of training data, most of the existing multimodal depression detection systems based on speech, text or images are trained by limited depression data, so the performance is low ; In terms of feature extraction, the existing feature extraction methods lack the facial expression features of subjects when answering different questions, and their performance in the field of depression detection is insufficient, which limits the performance of the final depression detection system; in terms of depression classification modeling, The existing technology does not consider the long-term dependence of speech, text features and depression diagnosis; in terms of multi-modal fusion, the feature distribution between different modal data is very different. The output of the subsystems is connected in series, because the feature difference between the modalities leads to information loss in the process of feature fusion, so the performance is limited; in terms of model selection, the traditional method mostly uses the depression detection method based on the recurrent neural network, Limit the length of audio and video

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal depression detection method and system based on time convolutional neural network
  • Multi-modal depression detection method and system based on time convolutional neural network
  • Multi-modal depression detection method and system based on time convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention will be further described in detail below in conjunction with specific embodiments. These embodiments are implemented on the premise of the technical solution of the present invention, and detailed implementation methods and specific operation processes are provided, but the protection scope of the present invention is not limited to the following examples.

[0053] Such as figure 1 Shown is a schematic diagram of a multimodal depression detection method based on temporal convolutional neural network, which specifically includes the following steps:

[0054] Step 1: construct a training sample set, the training sample set includes audio, 3D facial expressions and corresponding text information of patients with depression and non-depression;

[0055] Step 2: Carry out facial expression feature extraction to the 3D facial expression of described training sample set, obtain the 3D facial expression feature vector with situation awareness;

[0056] S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-mode depression detection method and system based on a time convolutional neural network. The detection method specifically comprises the following steps: constructing a training sample set, wherein the training sample set comprises audios, 3D facial expressions and corresponding text information of depression and non-depression patients; performing 3D facial expression feature extraction on the training sample set to obtain a 3D facial expression feature vector with context awareness; the feature extraction module is used for performing acoustic feature extraction on the audio signals of the training sample set in combination with a Mel-frequency cepstrum coefficient to obtain voice vector features with context awareness; using a Transform model for processing word embedding of the training sample set, and obtaining text features with context awareness; fusing the 3D facial expression features, the voice vector features and the text features to obtain information for depression classification; and substituting the information for depression classification into the time convolutional neural network to obtain depression classification information. According to the invention, the accuracy of depression detection can be improved.

Description

technical field [0001] The invention belongs to the technical field of big data, and in particular relates to a multimodal depression detection method and system based on a temporal convolutional neural network. Background technique [0002] There are nearly 800,000 patients who commit suicide due to depression every year in the world. Compared with other physical diseases, mental disorders are more difficult to detect. In early clinical practice, doctors determined whether a patient was depressed by diagnosing the severity of depressive symptoms during a personal interview. Later, researchers quantitatively analyzed the time-domain characteristics of speech signals, such as pause time, recording time, feedback time to questions, speech speed, etc., to help doctors make auxiliary diagnoses for patients with depression. However, it was found that a single feature is less discriminative for auxiliary clinical diagnosis. In recent years, with the in-depth development of speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V40/174G06V40/168G06N3/045G06F18/241G06F18/253G06F18/214
Inventor 杨忠丽李明定张光华武海荣
Owner 杭州医典智能科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More