Speech segmentation method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech and speech segments, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of low accuracy and poor segmentation effect, and achieve the effect of improving accuracy and good effect.

Active Publication Date: 2017-05-31

PING AN TECH (SHENZHEN) CO LTD

View PDF3 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The traditional speech segmentation technology is based on the global background model and the Gaussian mixture model. Due to technical limitations, the segmentation accuracy of this speech segmentation method is not high, especially for the dialogues that frequently alternate and overlap. Difference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] The principles and features of the present invention will be described below with reference to the accompanying drawings. The examples are only used to explain the present invention, but not to limit the scope of the present invention.

[0046] like figure 1 shown, figure 1 It is a schematic flowchart of an embodiment of a method for segmentation of speech according to the present invention, and the method for segmentation of speech includes the following steps:

[0047] Step S1, when receiving the mixed voice sent by the terminal, the automatic response system divides the mixed voice into a plurality of short voice segments, and marks each short voice segment with a corresponding speaker identifier;

[0048] This embodiment can be applied to an automatic answering system of a call center, such as an automatic answering system of an insurance call center, an automatic answering system of various customer service call centers, and the like. The automatic answering syst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speech segmentation method and a speech segmentation device. The speech segmentation method comprises the following steps: segmenting mixed speech into a plurality of short speech segments when the mixed speech, which is transmitted from a terminal, is received by virtue of an automatic response system, and labeling the various short speech segments with corresponding speaker identifiers; and establishing a vocal print model for the short speech segments corresponding to the various speaker identifiers by virtue of a time recurrent neural network, and regulating corresponding segmentation boundaries in the mixed speech on the basis of the vocal print model, so as to segment out effective speech segments corresponding to the various speaker identifiers. With the application of the method and the device provided by the invention, precision of speech segmentation can be effectively enhanced; and especially for speeches that conversations are frequently alternated and are overlapped, a relatively good effect of speech segmentation is achieved.

Description

technical field [0001] The present invention relates to the technical field of speech processing, and in particular, to a method and device for segmentation of speech. Background technique [0002] At present, many voices received by a call center are mixed with voices of multiple people. In this case, voice segmentation (speaker diarization) needs to be performed on the voices before further voice analysis is performed on the target voices. Speech segmentation refers to: in the field of speech processing, when the speech of multiple speakers is combined and recorded in one channel, the speech of each speaker in the signal is extracted separately. The traditional speech segmentation technology is based on the global background model and the Gaussian mixture model. Due to technical limitations, the segmentation accuracy of this speech segmentation method is not high, especially for frequent dialogues and overlapping dialogue segmentation effects. Difference. SUMMARY OF THE...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/04

CPCG10L15/04

Inventor王健宗郭卉肖京

OwnerPING AN TECH (SHENZHEN) CO LTD

Speech segmentation method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology