Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sound activity detecting method and detector thereof

A sound activity detection and detector technology, applied in the field of communication, can solve problems such as independence, poor versatility and maintainability, high cost of porting between codes, insufficient description of audio signal characteristics, etc., to achieve the effect of convenient maintenance and update

Inactive Publication Date: 2008-06-11
HUAWEI TECH CO LTD
View PDF0 Cites 80 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] It can be seen from the prior art that the prior art detects the music signal on the basis of the VAD technology in the existing speech coding standard, so it is closely related to the encoding algorithm, that is, the coupling with the encoder itself is too large, and the independence , Versatility and maintainability are generally poor, and the cost of porting between codes is high
[0007] In addition, the existing VAD algorithms are all developed for speech signals, so they only divide the input audio signal into two types: noise and speech (non-noise). Corrections and additions
Therefore, as the application scenario of the codec algorithm gradually transitions from processing voice to processing multimedia voice (including multimedia music), the codec algorithm itself gradually expands from narrowband to broadband. Therefore, with the change of application scenarios, the existing VAD The simple output class of the algorithm is obviously not enough to describe a wide variety of audio signal characteristics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound activity detecting method and detector thereof
  • Sound activity detecting method and detector thereof
  • Sound activity detecting method and detector thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Since the speech signal, noise signal and music signal have different distribution characteristics in the frequency spectrum, and the frame-to-frame changes of the speech, music and noise sequences also have their own characteristics. The embodiment of the present invention considers first extracting the characteristic parameters of various audio signals based on the characteristics of these signal frames, and then performing primary classification on the input narrowband audio or wideband audio digital signal frames according to these specific parameters, and classifying the input signals into non-noise Signal frame (that is, useful signal, including speech and music), noise frame, and silent signal frame. Then the signal frames judged as non-noise are further divided into voiced sound, unvoiced sound and music signal frames.

[0023] The first embodiment provided by the present invention is a sound activity detector (General Sound Activity Detection, GSAD), its struct...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sound activation detecting method and a sound activation detector, the core of which is: extracting the feature parameters of the current signal frame when the sound activation detection is needed; and determining the sound type of the current signal frame according to the feature parameters and the set parameter threshold. By the invention, the specific coding algorithm is not relied on when the used feature parameters in the classifying process are extracted, thus being convenient for maintenance and updating, and classifying the input signals into more sound types. When being used in the sound coding technical field, the invention can not only be used as new-opened variable rate sound frequency coding algorithm and standard rate selection foundation, but also provide foundation of rate selection for prior variable rate voice or sound frequency coding standard without VAD algorithm. The invention can be applicable to voice boosting, voice recognition, recognition of spoken person and other voice signal processing fields with strong commonality.

Description

technical field [0001] The invention relates to the communication field, in particular to the voice signal processing technology. Background technique [0002] In the field of speech signal processing, there is a technology for detecting voice activity. When it is applied in speech coding technology, it is called voice activity detection (Voice Activity Detection, VAD). When it is applied in speech recognition technology, it is usually It is called speech endpoint detection (Speech EndpointDetection), and when it is applied in speech enhancement technology, it is usually called speech gap detection (SpeechPause Detection). For different application scenarios, these technologies will have different emphases and produce different processing results. However, their essence is to detect whether there is voice during voice communication, and the accuracy of the detection result directly affects the quality of subsequent processing (such as voice coding, voice recognition and enh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/00G10L11/02G10L11/06G10L15/00G10L15/02G10L15/04G10L19/00G10L19/12G10L19/14G10L21/02G10L25/78
CPCG10L25/78
Inventor 严勤邓浩江王珺曾学文张军张立斌
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products