Active voice detection method and system based on noise scene recognition
A technology of speech detection and scene recognition, which is applied in speech analysis, character and pattern recognition, instruments, etc., can solve problems such as classifier burden, and achieve the effect of ensuring accuracy and effective detection ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] Such as figure 1 As shown, the present embodiment discloses a method for detecting active speech based on noise scene recognition, comprising the following steps:
[0057] S1: Extracting preferred features oriented to a noise classification task from the audio signal, and inputting the preferred feature values into a noise type classifier to identify the noise type in the audio signal.
[0058] Such as figure 2 As shown, the noise type classifier is constructed by the following steps:
[0059] S1-1: building a noise signal library, the noise signal library includes multiple types of noise signals;
[0060] S1-2: Using a time-frequency domain signal processing method to extract feature values of multiple audio features of each noise signal in the noise signal library;
[0061] For the task of distinguishing noise types, in order to obtain the distinguishability information between different noise signals from multiple angles, the present invention extracts zero-c...
Embodiment 2
[0149] The active voice detection method based on noise scene recognition in Embodiment 1 can be realized by the following active voice detection system.
[0150] Such as Figure 10 Shown, a kind of active voice detection system based on noise scene recognition, including:
[0151] The first feature extraction unit is used to extract the preferred features for noise classification tasks from the audio signal;
[0152] Noise classification identification unit, for identifying the noise type in the audio signal by the noise type classifier according to the preferred feature oriented to the noise classification task;
[0153] The model selection unit is used to determine the preferred features and classifiers suitable for audio signal-oriented speech and noise classification tasks according to the noise type;
[0154] The second feature extraction unit is used to extract the feature value of the preferred feature for speech and noise classification tasks from the audio signal; ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com