A speech signal feature learning method based on the first derivative of Mel spectrum

A speech signal, first derivative technology, applied in speech analysis, instruments, etc., to achieve the effect of high speed and scalability, good discrimination, and less training time

Active Publication Date: 2021-06-08
浙江中点人工智能科技有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But this kind of diagnosis depends on the doctor's personal senses and the valuable experience accumulated in the long-term practice of medicine, and this experience cannot be copied

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A speech signal feature learning method based on the first derivative of Mel spectrum
  • A speech signal feature learning method based on the first derivative of Mel spectrum

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The application will be described in further detail below in conjunction with the accompanying drawings. It is necessary to point out that the following specific embodiments are only used to further illustrate the application, and cannot be interpreted as limiting the protection scope of the application. The above application content makes some non-essential improvements and adjustments to this application.

[0027] combine figure 1 , figure 2 As shown, the speech signal feature learning method based on the mel spectrum first derivative of the present invention comprises the steps:

[0028] Step 1. Input disease speech samples and healthy speech samples;

[0029] Step 2. Framing all samples, detecting speech endpoints, extracting the first derivative of Mel spectrum MFCC with respect to time DMS (first Derivative of Mel-Spectrogram), and using matrix A for each sample i express;

[0030] The analysis of MFCC is based on the auditory principle of the human ear, whic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention proposes a speech signal feature learning method based on the first-order derivative of the Mel spectrum. The method includes, on the basis of data drive, inputting disease speech samples and healthy speech samples, minute-handling all samples, and extracting the Mel spectrum. For the first derivative of time, use the cross-validation method to divide the training set and test set for disease samples and healthy samples respectively, and use the clustering algorithm to train dictionaries for healthy voices and pathological voices respectively. The DMS of each sample is linearly encoded and pooled using the minimum pooling method to obtain the final features. The supervised method makes full use of label information, and the learned features have better discriminative power.

Description

technical field [0001] The invention relates to the field of artificial intelligence speech recognition, in particular to a speech signal feature learning method based on the first derivative of Mel spectrum. Background technique [0002] The method of diagnosing diseases by sound has received widespread attention in recent years because of its advantages of simplicity, convenience, speed, and no need to damage the patient's body and invasive examination. Studies have shown that speech signals contain rich biomedical information. For example, speech can become very soft, and eventually develop into a monotonous, non-fluctuating voice, and it can be judged that an individual may suffer from Parkinson's disease. When an individual has thyroid disease, it can lead to hormonal imbalances that can even lead to paralysis or paralysis of the vocal cords, which can make the voice muffled and sometimes even whisper-like. By extracting and analyzing the biological information feature...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/18G10L25/27G10L25/48G10L25/66G10L17/04
CPCG10L17/04G10L25/18G10L25/27G10L25/48G10L25/66
Inventor 朱成华卢光明武克斌张大鹏钟德才
Owner 浙江中点人工智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products