Speech separation and tracking method for public security criminal investigation and monitoring

A speech separation and criminal investigation technology, applied in speech analysis, instrument, character and pattern recognition, etc., can solve the problems of multi-microphone nonlinear combination configuration stability, inability to adapt, and high time complexity of training and testing

Active Publication Date: 2019-09-03
GUANGDONG UNIV OF TECH
View PDF15 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Align and capture the position information of multiple target speakers through the combination of multiple microphone arrays, but this method has problems of nonlinear combination of multiple microphones and configuration stability;
[0005] 2. Use visual information as auxiliary information to enhance the performance of the speech separation and tracking system to separate and track speech signals. However, this method needs to combine speech information and visual information for simultaneous processing and analysis, and in practical applic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech separation and tracking method for public security criminal investigation and monitoring
  • Speech separation and tracking method for public security criminal investigation and monitoring
  • Speech separation and tracking method for public security criminal investigation and monitoring

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] The accompanying drawings are for illustrative purposes only and cannot be construed as limiting the patent;

[0079] In order to better illustrate this embodiment, some parts in the drawings will be omitted, enlarged or reduced, and do not represent the size of the actual product;

[0080] For those skilled in the art, it is understandable that some well-known structures and descriptions thereof may be omitted in the drawings.

[0081] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0082] Such as figure 1 As shown, it is a flow chart of a voice separation and tracking method for public security criminal investigation monitoring in this embodiment.

[0083] A kind of voice separation and tracking method that this embodiment proposes is used for public security criminal investigation monitoring, comprises the following steps:

[0084] S1. Import the initial speech accord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of speech signal recognition and processing, and provides a speech separation and tracking method for public security criminal investigation and monitoring. The speech separation and tracking method includes the following steps that initial speech is imported according to timing sequence, the initial speech is subjected to framing and windowing processing, and a windowed speech signal is obtained; the windowed speech signal is time-frequency decomposed, and a time-frequency two-dimensional signal is obtained by the short-time Fourier transform; an endpoint of the time-frequency two-dimensional signal is detected in a frequency domain, and a corresponding speech signal segment of an empty language segment is filtered; a bidirectional long and short time memory network structure is used for performing speech separation of the two filtered dimensional time-frequency signal, and a great deal of speech waveform of a target speaker are output; anda target speaker model based on GMM-UBM is established and trained, the speech waveform of the target speaker are taken as models and input, a GMM model of the target speaker is acquired through an adaptive method and the speech waveform are recognized, a sequence number of the target speaker is outputted, that is, a speech tracking result.

Description

technical field [0001] The invention relates to the technical field of voice signal recognition and processing, and more specifically, to a voice separation and tracking method for public security criminal investigation monitoring. Background technique [0002] In the field of public security criminal investigation and monitoring, it is difficult to obtain relevant important information for the audio clip because the obtained audio clip contains related interference factors such as background noise, multiple speakers, and reverberation. Therefore, in the process of processing the speech signal, it is necessary to separate the speech signals of multiple speakers before processing them separately. At the same time, due to the particularity of criminal investigation monitoring, the voice signals of multiple speakers are collected by the same pickup, so it is difficult to separate and process the voice signals of multiple speakers. In addition, in the actual monitoring process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/06G10L17/18G10L21/0272G10L25/78G06K9/62
CPCG10L17/06G10L17/18G10L21/0272G10L25/78G06F18/23213
Inventor 郝敏李扬刘航
Owner GUANGDONG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products