Speech depression state recognition method based on feature selection and transfer learning

A technology of transfer learning and feature selection, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of widening the distribution of speech signal features of the subjects, unsatisfied assumptions, and increasing the difficulty of model recognition, etc., to achieve low Model complexity, optimal recognition accuracy, and the effect of improving model efficiency

Pending Publication Date: 2021-10-26
FUDAN UNIV
View PDF16 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These factors will further increase the difference in the feature distribution of the speech signals of different subjects, increasing the difficulty of model recognition
[0005] In addition, in speech signal-related machine learning, when dividing the training set and test set, it is usually assumed that the data in the test set and the training set are independent and identically distributed. However, the feature distribution of the speech signal of the subject is not only affected by the level of depression , it will also be affected by other factors such as age, gender, occupation and other factors of the individual differences of the subjects, which will lead to the failure of this assumption and reduce the performance of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech depression state recognition method based on feature selection and transfer learning
  • Speech depression state recognition method based on feature selection and transfer learning
  • Speech depression state recognition method based on feature selection and transfer learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0020] figure 1 It is a flowchart of a speech suppression state recognition method based on feature selection and transfer learning according to an embodiment of the present invention.

[0021] like figure 1 As shown, the speech suppression state recognition method based on feature selection and transfer learning in the embodiment of the present invention includes the following steps:

[0022] Step S1, voice information collection, use recording equipment to collect voice, design questions of different types of speech tasks, the subjects answer according to the prompts on the screen, use the recording equipment to collect the complete speaking process of the subjects, and record it as a wav file, This file is the speech sample.

[0023] Step S2, voice signal preprocessing, preprocessing the collected voice samples, manual screening to exclude obvious noise segments, such as coughing, sounds of things falling, and high-pass filtering, down-sampling, silent segment detection a...

Embodiment 2

[0064] As mentioned above, Embodiment 1 provides a speech depression state recognition method based on feature selection and transfer learning, which mainly includes steps S1 to S6. In actual application, each step of the method in Embodiment 1 can be configured as a corresponding computer module, that is, a speech collection part, a preprocessing part, a feature extraction part, a feature processing part, a transfer learning part, and a classification part, and these parts form a A device for classifying and identifying speech depression states, thereby also providing a speech depression state recognition device based on feature selection and transfer learning.

[0065] figure 2 It is a schematic diagram of a speech depression state recognition device based on feature selection and transfer learning according to an embodiment of the present invention.

[0066] like figure 2 As shown, the speech depression state recognition device based on feature selection and transfer le...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech depression state recognition method based on feature selection and transfer learning, and provides the speech depression state recognition method fusing Lasso and a transfer learning method CORAL in order to solve the two problems that the feature dimension is higher, and the feature distribution is affected by the tested individual differences except for the depression level during modeling based on speech. The method has the advantages that the redundant information in Lasso filtering features is reserved, the effective features are reserved, and the recognition precision is further improved on the basis of improving the model efficiency; 2, on the premise that the depression label information is not leaked by the transfer learning method CORAL, the feature distribution of a training set and a test set is drawn close, and the influence of other factors except the depression level on the feature distribution is reduced; the combination of the two methods can further improve the accuracy and stability of depression screening.

Description

technical field [0001] The invention belongs to the field of speech signal processing, and in particular relates to a speech depression state recognition method based on feature selection and transfer learning. Background technique [0002] Depression is a typical and common mental illness worldwide, covering all ages. The current clinical diagnosis method of depression relies on the doctor's clinical experience and related scales filled in by the patient. The whole process takes a long time, and the diagnosis Process inefficiencies. Speech, as an important external expression of emotion, has become a key direction for researchers to realize automatic depression recognition methods because of its unique advantages such as less restrictions on use, low equipment cost, no contact, non-invasive and convenient acquisition methods. [0003] At present, there is no clear specific feature supported by theoretical background for depression recognition. The feature design level is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L25/63
CPCG10L15/02G10L15/08G10L15/063G10L25/63
Inventor 赵张王守岩汪静莹刘伟
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products