A speech signal feature learning method based on the first derivative of Mel spectrum

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech signal, first derivative technology, applied in speech analysis, instruments, etc., to achieve the effect of high speed and scalability, good discrimination, and less training time

Active Publication Date: 2021-06-08

浙江中点人工智能科技有限公司

View PDF11 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

But this kind of diagnosis depends on the doctor's personal senses and the valuable experience accumulated in the long-term practice of medicine, and this experience cannot be copied

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026] The application will be described in further detail below in conjunction with the accompanying drawings. It is necessary to point out that the following specific embodiments are only used to further illustrate the application, and cannot be interpreted as limiting the protection scope of the application. The above application content makes some non-essential improvements and adjustments to this application.

[0027] combine figure 1 , figure 2 As shown, the speech signal feature learning method based on the mel spectrum first derivative of the present invention comprises the steps:

[0028] Step 1. Input disease speech samples and healthy speech samples;

[0029] Step 2. Framing all samples, detecting speech endpoints, extracting the first derivative of Mel spectrum MFCC with respect to time DMS (first Derivative of Mel-Spectrogram), and using matrix A for each sample i express;

[0030] The analysis of MFCC is based on the auditory principle of the human ear, whic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention proposes a speech signal feature learning method based on the first-order derivative of the Mel spectrum. The method includes, on the basis of data drive, inputting disease speech samples and healthy speech samples, minute-handling all samples, and extracting the Mel spectrum. For the first derivative of time, use the cross-validation method to divide the training set and test set for disease samples and healthy samples respectively, and use the clustering algorithm to train dictionaries for healthy voices and pathological voices respectively. The DMS of each sample is linearly encoded and pooled using the minimum pooling method to obtain the final features. The supervised method makes full use of label information, and the learned features have better discriminative power.

Description

technical field [0001] The invention relates to the field of artificial intelligence speech recognition, in particular to a speech signal feature learning method based on the first derivative of Mel spectrum. Background technique [0002] The method of diagnosing diseases by sound has received widespread attention in recent years because of its advantages of simplicity, convenience, speed, and no need to damage the patient's body and invasive examination. Studies have shown that speech signals contain rich biomedical information. For example, speech can become very soft, and eventually develop into a monotonous, non-fluctuating voice, and it can be judged that an individual may suffer from Parkinson's disease. When an individual has thyroid disease, it can lead to hormonal imbalances that can even lead to paralysis or paralysis of the vocal cords, which can make the voice muffled and sometimes even whisper-like. By extracting and analyzing the biological information feature...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L25/18G10L25/27G10L25/48G10L25/66G10L17/04

CPCG10L17/04G10L25/18G10L25/27G10L25/48G10L25/66

Inventor朱成华卢光明武克斌张大鹏钟德才

Owner浙江中点人工智能科技有限公司

A speech signal feature learning method based on the first derivative of Mel spectrum

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology