Speech separation and tracking method for public security criminal investigation and monitoring

A speech separation and criminal investigation technology, applied in speech analysis, instrument, character and pattern recognition, etc., can solve the problems of multi-microphone nonlinear combination configuration stability, inability to adapt, and high time complexity of training and testing
CN110197665AActive Publication Date: 2019-09-03GUANGDONG UNIV OF TECH

Patent Information

Authority / Receiving Office
CN Β· China
Current Assignee / Owner
GUANGDONG UNIV OF TECH
Publication Date
2019-09-03

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to the technical field of speech signal recognition and processing, and provides a speech separation and tracking method for public security criminal investigation and monitoring. The speech separation and tracking method includes the following steps that initial speech is imported according to timing sequence, the initial speech is subjected to framing and windowing processing, and a windowed speech signal is obtained; the windowed speech signal is time-frequency decomposed, and a time-frequency two-dimensional signal is obtained by the short-time Fourier transform; an endpoint of the time-frequency two-dimensional signal is detected in a frequency domain, and a corresponding speech signal segment of an empty language segment is filtered; a bidirectional long and short time memory network structure is used for performing speech separation of the two filtered dimensional time-frequency signal, and a great deal of speech waveform of a target speaker are output; anda target speaker model based on GMM-UBM is established and trained, the speech waveform of the target speaker are taken as models and input, a GMM model of the target speaker is acquired through an adaptive method and the speech waveform are recognized, a sequence number of the target speaker is outputted, that is, a speech tracking result.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of voice signal recognition and processing, and more specifically, to a voice separation and tracking method for public security criminal investigation monitoring. Background technique

[0002] In the field of public security criminal investigation and monitoring, it is difficult to obtain relevant important information for the audio clip because the obtained audio clip contains related interference factors such as background noise, multiple speakers, and reverberation. Therefore, in the process of processing the speech signal, it is necessary to separate the speech signals of multiple speakers before processing them separately. At the same time, due to the particularity of criminal investigation monitoring, the voice signals of multiple speakers are collected by the same pickup, so it is difficult to separate and process the voice signals of multiple speakers. In addition, in the actual monitoring process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More