Voice segment detection method based on MFCC similarity of EMD-Wavelet

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A detection method and similarity technology, applied in speech analysis, instruments, etc., can solve the problems of MFCC effect not obvious, and achieve the effect of good robustness and adaptability, suppression of influence, and high accuracy

Active Publication Date: 2019-03-01

SOUTHEAST UNIV

View PDF9 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although MFCC can well reflect the signal characteristics of speech signals, in the case of low signal-to-noise ratio, the effect of MFCC for speech segment detection is not obvious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0050] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0051] see figure 1 , figure 1 is a flowchart of the present invention. The steps of the present invention will be described in detail below in conjunction with the flowchart.

[0052] A kind of speech segment detection method based on the MFCC similarity of EMD-Wavelet, the method comprises the steps:

[0053] (1) Measure the voice signal of people speaking, and use the collected voice signal as the source signal;

[0054] (2) EMD is used to decompose the noisy speech signal to obtain the intrinsic mode function (IMF) of each order reflecting the high and low frequency energy of the speech signal. The specific process is as follows:

[0055] (2.1) First find out the local extremum points of the overall signal, and use cubic spline interpolation to form the upper envelope and lower envelope of the signal respectively.

[0056] (2.2) Calcul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice segment detection method based on MFCC similarity of EMD-Wavelet. The method includes: collecting a speaker's voice signal as the source signal; decomposing a noisy voice signal by empirical mode decomposition (EMD) to obtain an all-order intrinsic mode function (IMF); determining the order of a noise-dominated mode IMF by the variance of all-order IMF components' autocorrelation coefficients, conducting wavelet thresholding denoising on the noise-dominated mode IMF, performing reconstruction with the denoised low-order IMF component and remaining high-order IMFcomponent, thus obtaining a de-noised voice signal; calculating the Mel frequency cepstrum coefficient (MFCC) of the voice signal, and adopting Euclidean distance as the measure of the voice signal MFCC similarity; and obviously distinguishing a voice segment and a noise segment from a similarity curve, thus realizing voice segment extraction. Compared with traditional detection methods, the method provided by the invention has better robustness and adaptability, higher voice segment detection accuracy, and can be well applied to voice segment extraction of voice signals.

Description

Technical field: [0001] The invention relates to a method for detecting speech segments based on MFCC similarity of EMD-Wavelet, and belongs to the field of detecting speech segments in speech signal processing. Background technique: [0002] Speech segment detection is an important part of speech signal analysis and processing. Its purpose is to extract the speech segment of the speech signal from a segment of signal containing speech. The detection accuracy directly affects the processing time and calculation amount of the speech signal. Therefore, improving the accuracy and efficiency of the detection of the start and end points of speech segments has always been a hot spot in the research of speech recognition technology. Traditional speech segment detection algorithms often use methods such as short-term energy, short-term zero-crossing rate, and autocorrelation maximum. These methods have achieved better detection results in the case of high SNR. Down detection is poo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/0208G10L25/24G10L25/51G10L25/78

CPCG10L21/0208G10L25/24G10L25/51G10L25/78

Inventor 贾民平花园胡建中许飞云黄鹏

Owner SOUTHEAST UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice segment detection method based on MFCC similarity of EMD-Wavelet

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology