Audio frequency classification method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A classification method and audio technology, applied in the field of information processing, can solve problems such as difficult hardware implementation, increased calculation amount, and large amount of calculation, so as to avoid misjudgment, reduce calculation amount, and improve accuracy

Inactive Publication Date: 2008-03-19

HUAWEI TECH CO LTD

View PDF0 Cites 31 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In prior art 1, each step needs to judge the category of audio according to one or several audio features and their thresholds. Therefore, this prior art requires a relatively large amount of computation when extracting feature parameters with better performance. For example, extracting MFCC The parameters need to perform Mel filtering, discrete cosine transform (DCT, Discrete CosineTransform), etc., so the amount of calculation is increased, and the existing technology is also affected by the judgment order of multiple characteristic parameters

In addition, in prior art 2, the classifier needs to be trained with a large amount of data in advance, the whole process requires a large amount of calculation, and it is not easy to realize by hardware

Therefore, the defect of the prior art is that the amount of calculation is relatively large in the process of audio signal classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0040] The bandwidth of the voice signal is between 0.3 Hz and 3.4 kHz, while the bandwidth of the music signal is generally around 22 kHz; the frequency center of the voice signal is lower than that of the music signal, and the energy of the voice signal is mainly concentrated in the low frequency band, while The frequency domain energy distribution of the music signal is relatively uniform, so the spectrum smoothing (SF) parameter of the speech signal is obviously larger than the SF parameter of the music signal.

[0041] According to above-mentioned theory and the defective of prior art, proposed a conception of judging signal type with spectrum smoothing parameter, the process of utilizing SF parameter to judge signal type is as follows: at first, calculate the Fast Fourier Transform (FFT, Fast Fourier Transform) of audio signal Get the spectrum amplitude; secondly, calculate the absolute value of the difference between the amplitude values of two adjacent points; then, c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention discloses an audio classifying method, which comprises preprocessing an input audio signal and then calculating the linear predictive coding coefficient of the processed audio signal; getting the spectral envelope of the signal according to the linear predictive coding coefficient and then determining the amplitude difference value of the coefficient by an index spectrum obtained by calculation; setting a threshold according to the statistical results of the amplitude difference values and then classifying the audio signal according to the threshold. The present invention can significantly reduce calculation amount brought by the classification of audio signals and, at the same time, have high accuracy in audio signal classification. In addition, when being applied to signal processing flow in extended bandwidth self-adaptive multi-rate coding standards, the present invention can reduce the calculation amount of audio signal classification to extremely low and, in addition, can ensure that the signal processing flow codes directly using corresponding coding modes without the need of pre-coding procedures, thereby improving the coding efficiency.

Description

technical field [0001] The invention relates to the field of information processing, in particular to an audio classification method. Background technique [0002] In the Extended Adaptive Multi-Ratc-Wideband (AMR-WB+, Extended Adaptive Multi-Ratc-Wideband) coding standard, there are two core coding modes, Algebraic Code Excited Linear Prediction (ACELP, Algebraic Code Excited Linear Prediction) and Transmission Transform Coding Excitation (TCX, Transform Coded Excitation) mode, ACELP mode is more suitable for voice signals, and TCX mode is better for encoding music signals. In the AMR-WB+ standard, it is necessary to pre-encode each frame of signal, and then choose which best mode to use for encoding, but each frame of signal must be pre-encoded, which will lead to a very large amount of calculation, so it is necessary to Signals are pre-classified to reduce computation. Speech and music are the two most important types of data in audio signals, so distinguishing speech a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L19/14G10L19/12G10L19/00

Inventor 郭利斌马付伟

Owner HUAWEI TECH CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Audio frequency classification method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology