Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for extracting mfcc features

An extraction method and a technology of high-frequency components, which are applied in speech analysis, speech recognition, instruments, etc., can solve problems such as channel distortion, poor robustness of MFCC features, and incomplete matching of test data, so as to improve accuracy and robustness sexual effect

Active Publication Date: 2015-11-25
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, since the training data is pure audio data without any interference, and the test data is audio data collected in a natural environment, there may be obvious distortion due to some reasons, such as the noise of the surrounding environment, the transmission system introduced channel distortion, etc., so that the test data may not completely match the training data. Therefore, there may be a large difference between the MFCC features extracted from the test data and the MFCC features extracted from the training data using the existing technology, so that the MFCC features less robust

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting mfcc features
  • Method and device for extracting mfcc features
  • Method and device for extracting mfcc features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] It should be noted that the terminals involved in the embodiments of the present invention may include but not limited to mobile phones, personal digital assistants (Personal Digital Assistant, PDA), wireless handheld devices, wireless netbooks, personal computers, portable computers, MP3 players, MP4 players Wait.

[0029] In addition, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an extraction method and device for mel frequency cepstrum coefficient (MFCC) characteristics. The extraction method comprises the steps of utilizing a high-frequency section Mel filter contained in a Mel filter group to perform filtration treatment on preprocessed audio signals so as to generate Mel region high-frequency components, performing discrete cosine transform on the Mel region high-frequency components so as to generate conversion characteristics of every Mel region high-frequency component, and obtaining the MFCC characteristics of the audio signals according to the conversion characteristics of every Mel region high-frequency component. Due to the fact that the high-frequency section Mel filter contained in the Mel filter group is utilized to perform filtration treatment on the preprocessed audio signals, the Mel region high-frequency components can be obtained, Mel region low-frequency components subjected to environmental influence easily can be removed, the MFCC characteristics extracted from test data and MFCC characteristics extracted from training data do not have large difference, and accordingly robustness of the MFCC characteristics is improved.

Description

【Technical field】 [0001] The present invention relates to audio feature extraction technology, in particular to a method and device for extracting Mel Frequency Cepstrum Coefficient (MFCC) features. 【Background technique】 [0002] With the development of communication technology, the terminal integrates more and more functions, so that the system function list of the terminal includes more and more corresponding applications, for example, applications installed in the computer, third-party smart phones Installed applications (Application, APP), etc. Some applications involve the feature extraction of Mel Frequency Cepstrum Coefficient (MFCC) of some audio signals, for example, content-based music identification (Music Identification) service, similar music recommendation (Music Recommendation) service and other audio recognition services. In the prior art, a Mel filter bank is used to filter the preprocessed audio signal; then, a discrete cosine transform (DiscreteCosine Tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/20G10L21/0232
Inventor 宋辉石立臣谢延
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD