Audio classification and implementation method based on reclassification

A classification method and a classification device technology, which are applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficult hardware implementation, undisclosed implementation method of audio classification based on reclassification, large amount of calculation, etc., and achieve simple distinction , reduce computational complexity, and achieve excellent audio quality

Inactive Publication Date: 2010-06-23
数维科技(北京)有限公司
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above classification methods have the following disadvantages: (1) they can only describe static statistical characteristics such as the mean and variance of the audio, while audio signal features usually have temporal statistical characteristics, for example, there are generally rhythms or drums that can reveal the theme in music; and In speech, unvoiced and voiced sounds tend to alternate, and these features are time-dependent
(2) Decision rules and search order are not necessarily optimal
The disadvantage of the above classification method is that the classifier needs to be trained with a large amount of data in advance, the whole process has a large amount of calculation, and it is not easy to implement in hardware
[0006] In addition, the prior art does not actually disclose the audio classification based on reclassification and its implementation method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio classification and implementation method based on reclassification
  • Audio classification and implementation method based on reclassification
  • Audio classification and implementation method based on reclassification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Preferred embodiments of the present invention will be described hereinafter by means of the accompanying drawings. In the following description, functions or constructions that have become prior art will not be described in detail since they would obscure the description of the present invention in unnecessary detail.

[0023] Such as figure 1 with 2 As shown, an embodiment of the present invention provides an audio classification method, which classifies audio signals before compressing and encoding the audio signals. Specifically, by judging the audio signal to be encoded as speech and music, the sensory audio encoder can be guided to adjust parameters adaptively according to the above classification results during audio encoding, so that the encoded audio quality can be better. The perceptual audio encoder can be any prior art audio encoder, such as the MPEG encoding specified in the Chinese national standard GB / T 17975.3-2002 "General Coding of Information Techno...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an audio classification method for classifying audio signals before audio coding; the audio classification method includes primary classification and reclassification, and is characterized in that the reclassification includes the primary classification result is smoothened. Besides, the invention also discloses an audio classification device arranged on the front end of an audio coder for classifying the audio signals; the audio classification device comprises a primary classifier and a reclassifier, and is characterized in that the reclassifier comprises a smoothing module for smoothing the primary classification result. The method and device of the invention are used to correctly distinguish music and voice from audio signals. As the reclassification includes the smoothening on the primary classification result, the occasional false judgment caused by overquick audio type switching is eliminated, and the operational complexity is also educed so that the correct and simple distinguishing of music and voice is realized.

Description

technical field [0001] The present invention relates to a device for distinguishing whether an audio signal is speech or music before audio coding and its implementation method, more specifically, an audio classification device based on reclassification and its implementation method. Background technique [0002] [001] Voice and music are two main types of audio data, and the classification of voice and music is one of the important means to extract audio structure and content semantics. In addition to limited registration information such as sampling rate, quantization accuracy, and encoding method, original audio data itself is only a non-semantic symbol representation and unstructured binary code stream, lacking content semantic description and structured organization. How to extract the structured information and content semantics in the audio, so that the unordered audio data becomes orderly, is the key to the practicality of content-based audio retrieval technology. C...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/00G10L15/08
Inventor 张培闫建新
Owner 数维科技(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products