Speech separation and tracking method for public security criminal investigation and monitoring

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech separation and criminal investigation technology, applied in speech analysis, instrument, character and pattern recognition, etc., can solve the problems of multi-microphone nonlinear combination configuration stability, inability to adapt, and high time complexity of training and testing

Active Publication Date: 2019-09-03

GUANGDONG UNIV OF TECH

View PDF15 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] 1. Align and capture the position information of multiple target speakers through the combination of multiple microphone arrays, but this method has problems of nonlinear combination of multiple microphones and configuration stability;

[0005] 2. Use visual information as auxiliary information to enhance the performance of the speech separation and tracking system to separate and track speech signals. However, this method needs to combine speech information and visual information for simultaneous processing and analysis, and in practical applications There is a delay problem in the collected audio and image, which makes it impossible to adapt;

[0006] 3. The speech signal is processed by using the effective bit coding vector or the speech information of the target speaker as an additional input to the speech separation system, but this method cannot achieve end-to-end speech tracking, and compared with a separate speech tracking algorithm, Due to the introduction of target speaker identity information as input, there is a problem that the time complexity of training and testing is too high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0078] The accompanying drawings are for illustrative purposes only and cannot be construed as limiting the patent;

[0079] In order to better illustrate this embodiment, some parts in the drawings will be omitted, enlarged or reduced, and do not represent the size of the actual product;

[0080] For those skilled in the art, it is understandable that some well-known structures and descriptions thereof may be omitted in the drawings.

[0081] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0082] Such as figure 1 As shown, it is a flow chart of a voice separation and tracking method for public security criminal investigation monitoring in this embodiment.

[0083] A kind of voice separation and tracking method that this embodiment proposes is used for public security criminal investigation monitoring, comprises the following steps:

[0084] S1. Import the initial speech accord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of speech signal recognition and processing, and provides a speech separation and tracking method for public security criminal investigation and monitoring. The speech separation and tracking method includes the following steps that initial speech is imported according to timing sequence, the initial speech is subjected to framing and windowing processing, and a windowed speech signal is obtained; the windowed speech signal is time-frequency decomposed, and a time-frequency two-dimensional signal is obtained by the short-time Fourier transform; an endpoint of the time-frequency two-dimensional signal is detected in a frequency domain, and a corresponding speech signal segment of an empty language segment is filtered; a bidirectional long and short time memory network structure is used for performing speech separation of the two filtered dimensional time-frequency signal, and a great deal of speech waveform of a target speaker are output; anda target speaker model based on GMM-UBM is established and trained, the speech waveform of the target speaker are taken as models and input, a GMM model of the target speaker is acquired through an adaptive method and the speech waveform are recognized, a sequence number of the target speaker is outputted, that is, a speech tracking result.

Description

technical field [0001] The invention relates to the technical field of voice signal recognition and processing, and more specifically, to a voice separation and tracking method for public security criminal investigation monitoring. Background technique [0002] In the field of public security criminal investigation and monitoring, it is difficult to obtain relevant important information for the audio clip because the obtained audio clip contains related interference factors such as background noise, multiple speakers, and reverberation. Therefore, in the process of processing the speech signal, it is necessary to separate the speech signals of multiple speakers before processing them separately. At the same time, due to the particularity of criminal investigation monitoring, the voice signals of multiple speakers are collected by the same pickup, so it is difficult to separate and process the voice signals of multiple speakers. In addition, in the actual monitoring process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/06G10L17/18G10L21/0272G10L25/78G06K9/62

CPCG10L17/06G10L17/18G10L21/0272G10L25/78G06F18/23213

Inventor郝敏李扬刘航

OwnerGUANGDONG UNIV OF TECH

Speech separation and tracking method for public security criminal investigation and monitoring

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology