A Voice Separation and Tracking Method for Public Security Criminal Investigation Monitoring

A voice separation and criminal investigation technology, applied in voice analysis, instrumentation, computing, etc., can solve the problems of multi-microphone nonlinear combination configuration stability, high time complexity of training and testing, and inability to achieve end-to-end voice tracking, etc. Achieve the effect of real-time speech separation and tracking, reduce delay, and reduce generalization error
CN110197665BActive Publication Date: 2021-07-09GUANGDONG UNIV OF TECH

Patent Information

Authority / Receiving Office
CN Β· China
Patent Type
Patents(China)
Current Assignee / Owner
GUANGDONG UNIV OF TECH
Publication Date
2021-07-09

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to the technical field of voice signal recognition and processing, and proposes a voice separation and tracking method for public security criminal investigation and monitoring, comprising the following steps: importing initial voice according to time sequence, performing frame-by-frame windowing processing on the initial voice, and obtaining a windowed voice signal Carry out time-frequency decomposition to windowed speech signal, obtain time-frequency two-dimensional signal by short-time Fourier transform; Carry out endpoint detection in frequency domain to described time-frequency two-dimensional signal, the speech signal segment corresponding to empty speech segment Perform filtering processing; use the two-way long-short-term memory network structure to perform speech separation on the time-frequency two-dimensional signal that has completed the filtering process, and output multiple voice waveforms of the target speaker; establish and train a target speaker model based on GMM-UBM, and convert all The speech waveform of the target speaker is used as the model input, and the GMM model of the target speaker is acquired through adaptively, and then the speech waveform is recognized, and the serial number of the target speaker is output, which is the result of speech tracking.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of voice signal recognition and processing, and more specifically, to a voice separation and tracking method for public security criminal investigation monitoring. Background technique

[0002] In the field of public security criminal investigation and monitoring, it is difficult to obtain relevant important information for the audio clip because the obtained audio clip contains related interference factors such as background noise, multiple speakers, and reverberation. Therefore, in the process of processing the speech signal, it is necessary to separate the speech signals of multiple speakers before processing them separately. At the same time, due to the particularity of criminal investigation monitoring, the voice signals of multiple speakers are collected by the same pickup, so it is difficult to separate and process the voice signals of multiple speakers. In addition, in the actual monitoring process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More