Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio detecting and classifying method with customization function

A technology of audio detection and classification methods, applied in speech analysis, speech recognition, instruments, etc., which can solve the problem of not being able to customize multiple categories and make judgments

Active Publication Date: 2014-05-28
北京华控智加科技有限公司
View PDF6 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In order to overcome the above-mentioned shortcoming of prior art, the object of the present invention is to provide a kind of audio frequency detection and classification method with self-defining function, at first part original training set is divided into several kinds of training sets according to type, and feature extraction is carried out for each type of training set , and train the corresponding Gaussian mixture model and its parameters to obtain a global Gaussian mixture model; further use other training sets as new training samples, update the parameters of the global Gaussian mixture model to obtain a local model; finally extract features from the test set , input the local model classifier, and smooth and output the result, its main advantage is to overcome the problem that the original audio activation detection cannot customize multiple categories and make judgments

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio detecting and classifying method with customization function
  • Audio detecting and classifying method with customization function
  • Audio detecting and classifying method with customization function

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0040] figure 1 For the global model training flowchart of the audio detection classification of the present invention, include the following:

[0041] The present invention proposes a global model training method and device based on audio detection and classification, especially for the scenario of audio activation detection and classification. These methods and devices are not limited to audio activity detection classification, and may be any method and device related to audio classification.

[0042] figure 1 An example of global model training based on audio detection classification is described.

[0043] like figure 1 The first type of training samples 101 shown include all the first type of audio signals for training, the second type of training samples 102 include all of the second type of audio signals for training, and so on, the Mt...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an audio detecting and classifying method with the customization function. According to the audio detecting and classifying method, audio activated detection is conducted on audio data; firstly, a part of original training sets are classified into a plurality of types of training sets according to the types, feature extraction is conducted on each type of training sets, and a Gaussian hybrid model corresponding to each type of training sets and parameters of the Gaussian hybrid model are trained, so that an overall Gaussian hybrid model is obtained; secondly, the other training sets are used as new training samples, and parameter updating is conducted on the overall Gaussian hybrid model so that a local model can be obtained; finally, features of a test set are extracted, a local model classifier is input, and a result is smoothed and output. According to the audio detecting and classifying method with the customization function, through training of the overall Gaussian hybrid model and the training of the local Gaussian hybrid model, the types and the parameters of the Gaussian hybrid models can be updated along with the increase of the number of the samples; through the combination of the audio detecting and classifying method and the classifier, the performance of a system is further improved, and audio detection and classification are achieved finally; the audio detecting and classifying method with the customization function can be widely applied to multiple machine learning fields, such as speaker recognition, voice recognition and human-computer interaction, relating to audio detection and classification.

Description

technical field [0001] The invention belongs to the technical field of audio processing, in particular to an audio detection and classification method with self-defining functions. Background technique [0002] In systems such as audio recognition and speaker recognition, audio activity detection (Voice activity detection, VAD) technology is widely used. It is mainly used to exclude silence and noise signals that are not related to speakers in continuous audio signals, determine the starting point of audio segments and End position to improve the performance of speech recognition and speaker recognition systems. Effective and accurate audio activation detection can improve system recognition performance by removing noise or silent segment signals, reducing system data processing and interference with subsequent audio analysis and processing. Research on audio activation detection algorithms has been carried out for many years. Traditional audio activation detection methods ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14G10L15/20
CPCG10L25/78G10L25/24G10L25/51
Inventor 杨毅刘加
Owner 北京华控智加科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products