Gain processing method and device for speech recognition system

A technology of speech recognition and processing methods, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as performance degradation of the recognition system, and achieve the effect of improving robustness
CN105355197BActive Publication Date: 2020-01-07BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Publication Date
2020-01-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The application provides a gain processing method and a gain processing device for a speech recognition system, wherein the method comprises the following steps: acquiring a peak value of each audio section according to a preset division length in inputted first audio data of a preset frame length; according to the peak value of each audio section and a preset expected audio amplitude, acquiring a block gain of each audio section, wherein the audio expected amplitude is matched with training data in the speech recognition system; selecting M pieces of preset block gain values in all block gains from small to large and conducting median filtering treatment, and acquiring expected gains of the first audio data; and adjusting amplitudes of the first audio data by virtue of the expected gains. The automatic gain adjustment on the audio data is achieved, so that the amplitude of a received audio signal is more than a threshold value of the speech recognition system ad is matched with the training data; therefore, the stability of the speech recognition system is enhanced.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present application relates to the technical field of speech recognition processing, in particular to a gain processing method and device for a speech recognition system. Background technique

[0002] With the development of speech recognition technology, the application fields of speech recognition system are becoming wider and wider. Existing speech recognition systems usually use massive audio data to train a general model for speech recognition.

[0003] However, when the speech recognition system is actually used, there will inevitably be a mismatch between the statistical characteristics of the audio data to be recognized and the training data, and this mismatch is especially reflected in the amplitude of the audio signal. In addition, speech recognition systems generally require that the audio amplitude received by the microphone be higher than a certain threshold, and once the audio amplitude is lower than the threshold, the performance of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More