Active voice detection method and system based on noise scene recognition

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A technology of speech detection and scene recognition, which is applied in speech analysis, character and pattern recognition, instruments, etc., can solve problems such as classifier burden, and achieve the effect of ensuring accuracy and effective detection ability

Pending Publication Date: 2020-11-13

北京中电慧声科技有限公司 +1

View PDF0 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Supervised methods are mainly composed of feature extraction and classifier design. In terms of feature extraction, in order to effectively distinguish the acoustic characteristics of noise and speech signals, researchers currently extract high-dimensional features from different angles, such as energy features, process Zero rate features, Mel Frequency Cepstrum Coefficient (MFCC) features, fuzzy entropy features, autocorrelation coefficient features, wavelet coefficient features, etc., and combine multiple features to fuse multi-angle feature information, however , although these feature combinations have a certain ability to distinguish noise and speech under specific noise types, in practical applications, due to the time-varying nature of noise types, it is often difficult for feature combinations set for general situations to show stability in dynamic noise scenarios. The ability to distinguish, and the high dimensionality of features often brings a burden to the subsequent use of classifiers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0056] Such as figure 1 As shown, the present embodiment discloses a method for detecting active speech based on noise scene recognition, comprising the following steps:

[0057] S1: Extracting preferred features oriented to a noise classification task from the audio signal, and inputting the preferred feature values into a noise type classifier to identify the noise type in the audio signal.

[0058] Such as figure 2 As shown, the noise type classifier is constructed by the following steps:

[0059] S1-1: building a noise signal library, the noise signal library includes multiple types of noise signals;

[0060] S1-2: Using a time-frequency domain signal processing method to extract feature values of multiple audio features of each noise signal in the noise signal library;

[0061] For the task of distinguishing noise types, in order to obtain the distinguishability information between different noise signals from multiple angles, the present invention extracts zero-c...

Embodiment 2

[0149] The active voice detection method based on noise scene recognition in Embodiment 1 can be realized by the following active voice detection system.

[0150] Such as Figure 10 Shown, a kind of active voice detection system based on noise scene recognition, including:

[0151] The first feature extraction unit is used to extract the preferred features for noise classification tasks from the audio signal;

[0152] Noise classification identification unit, for identifying the noise type in the audio signal by the noise type classifier according to the preferred feature oriented to the noise classification task;

[0153] The model selection unit is used to determine the preferred features and classifiers suitable for audio signal-oriented speech and noise classification tasks according to the noise type;

[0154] The second feature extraction unit is used to extract the feature value of the preferred feature for speech and noise classification tasks from the audio signal; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an active voice detection method based on noise scene recognition, and the method comprises the steps: extracting an optimal feature facing a noise classification task from anaudio signal, inputting a feature value into a noise type classifier, so as to recognize a noise type in the audio signal, according to the noise type, determining a preferred feature and a classifiersuitable for the voice and noise classification task, extracting optimal features oriented to the voice and noise classification task from the audio signals, inputting the optimal features into a voice and noise classifier, and judging whether the voice signals exist in the audio signals or not. According to activity voice detection system based on noise scene recognition, before binary classification of noisy voice and noise signals is carried out, according to the method, the current noise type is detected and identified, the most distinctive feature combination is preferably selected for the specific noise type, model parameters can be designed for the specific noise type, and the effectiveness and stability of the performance of the whole detection process under different noise typesare ensured.

Description

technical field [0001] The invention relates to the technical field of voice data processing, in particular to a noise scene recognition-based active voice detection method and system. Background technique [0002] There are often pauses and intermittent phenomena in a segment of speech signal. These "silent" segments will be superimposed with environmental noise and do not contain effective speech information. This type of information occupies a large data transmission resource and interferes with the effect of speech signal processing. The goal of active voice detection (voice activity detection, VAD) technology is to detect real voice passages from the signal and remove these "silent" parts, thereby reducing the burden of subsequent voice signal processing. Therefore, active voice detection technology is widely used It is used in speech coding, speaker recognition, automatic speech recognition, abnormal sound detection and other systems. [0003] In view of the wide appl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/03G10L25/51G06K9/62

CPCG10L25/03G10L25/51G06F18/2321G06F18/24323

Inventor田野

Owner北京中电慧声科技有限公司

Active voice detection method and system based on noise scene recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology