Audio frequency classification method

A classification method and audio technology, applied in the field of information processing, can solve problems such as difficult hardware implementation, increased calculation amount, and large amount of calculation, so as to avoid misjudgment, reduce calculation amount, and improve accuracy

Inactive Publication Date: 2008-03-19
HUAWEI TECH CO LTD
View PDF0 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In prior art 1, each step needs to judge the category of audio according to one or several audio features and their thresholds. Therefore, this prior art requires a relatively large amount of computation when extracting feature parameters with better performance. For example, extracting MFCC The parameters need to perform Mel filtering, discrete cosine transform (DCT, Discrete CosineTransform), etc., so the amount of calculation is increa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio frequency classification method
  • Audio frequency classification method
  • Audio frequency classification method

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0040] The bandwidth of the voice signal is between 0.3 Hz and 3.4 kHz, while the bandwidth of the music signal is generally around 22 kHz; the frequency center of the voice signal is lower than that of the music signal, and the energy of the voice signal is mainly concentrated in the low frequency band, while The frequency domain energy distribution of the music signal is relatively uniform, so the spectrum smoothing (SF) parameter of the speech signal is obviously larger than the SF parameter of the music signal.

[0041] According to above-mentioned theory and the defective of prior art, proposed a conception of judging signal type with spectrum smoothing parameter, the process of utilizing SF parameter to judge signal type is as follows: at first, calculate the Fast Fourier Transform (FFT, Fast Fourier Transform) of audio signal Get the spectrum amplitude; secondly, calculate the absolute value of the difference between the amplitude values ​​of two adjacent points; then, c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses an audio classifying method, which comprises preprocessing an input audio signal and then calculating the linear predictive coding coefficient of the processed audio signal; getting the spectral envelope of the signal according to the linear predictive coding coefficient and then determining the amplitude difference value of the coefficient by an index spectrum obtained by calculation; setting a threshold according to the statistical results of the amplitude difference values and then classifying the audio signal according to the threshold. The present invention can significantly reduce calculation amount brought by the classification of audio signals and, at the same time, have high accuracy in audio signal classification. In addition, when being applied to signal processing flow in extended bandwidth self-adaptive multi-rate coding standards, the present invention can reduce the calculation amount of audio signal classification to extremely low and, in addition, can ensure that the signal processing flow codes directly using corresponding coding modes without the need of pre-coding procedures, thereby improving the coding efficiency.

Description

technical field [0001] The invention relates to the field of information processing, in particular to an audio classification method. Background technique [0002] In the Extended Adaptive Multi-Ratc-Wideband (AMR-WB+, Extended Adaptive Multi-Ratc-Wideband) coding standard, there are two core coding modes, Algebraic Code Excited Linear Prediction (ACELP, Algebraic Code Excited Linear Prediction) and Transmission Transform Coding Excitation (TCX, Transform Coded Excitation) mode, ACELP mode is more suitable for voice signals, and TCX mode is better for encoding music signals. In the AMR-WB+ standard, it is necessary to pre-encode each frame of signal, and then choose which best mode to use for encoding, but each frame of signal must be pre-encoded, which will lead to a very large amount of calculation, so it is necessary to Signals are pre-classified to reduce computation. Speech and music are the two most important types of data in audio signals, so distinguishing speech a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L19/14G10L19/12G10L19/00
Inventor 郭利斌马付伟
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products