Speech segmentation method and device

A technology of speech and speech segments, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of low accuracy and poor segmentation effect, and achieve the effect of improving accuracy and good effect.

Active Publication Date: 2017-05-31
PING AN TECH (SHENZHEN) CO LTD
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional speech segmentation technology is based on the global background model and the Gaussian mixture model. Due to technical limitations, th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech segmentation method and device
  • Speech segmentation method and device
  • Speech segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The principles and features of the present invention will be described below with reference to the accompanying drawings. The examples are only used to explain the present invention, but not to limit the scope of the present invention.

[0046] like figure 1 shown, figure 1 It is a schematic flowchart of an embodiment of a method for segmentation of speech according to the present invention, and the method for segmentation of speech includes the following steps:

[0047] Step S1, when receiving the mixed voice sent by the terminal, the automatic response system divides the mixed voice into a plurality of short voice segments, and marks each short voice segment with a corresponding speaker identifier;

[0048] This embodiment can be applied to an automatic answering system of a call center, such as an automatic answering system of an insurance call center, an automatic answering system of various customer service call centers, and the like. The automatic answering syst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a speech segmentation method and a speech segmentation device. The speech segmentation method comprises the following steps: segmenting mixed speech into a plurality of short speech segments when the mixed speech, which is transmitted from a terminal, is received by virtue of an automatic response system, and labeling the various short speech segments with corresponding speaker identifiers; and establishing a vocal print model for the short speech segments corresponding to the various speaker identifiers by virtue of a time recurrent neural network, and regulating corresponding segmentation boundaries in the mixed speech on the basis of the vocal print model, so as to segment out effective speech segments corresponding to the various speaker identifiers. With the application of the method and the device provided by the invention, precision of speech segmentation can be effectively enhanced; and especially for speeches that conversations are frequently alternated and are overlapped, a relatively good effect of speech segmentation is achieved.

Description

technical field [0001] The present invention relates to the technical field of speech processing, and in particular, to a method and device for segmentation of speech. Background technique [0002] At present, many voices received by a call center are mixed with voices of multiple people. In this case, voice segmentation (speaker diarization) needs to be performed on the voices before further voice analysis is performed on the target voices. Speech segmentation refers to: in the field of speech processing, when the speech of multiple speakers is combined and recorded in one channel, the speech of each speaker in the signal is extracted separately. The traditional speech segmentation technology is based on the global background model and the Gaussian mixture model. Due to technical limitations, the segmentation accuracy of this speech segmentation method is not high, especially for frequent dialogues and overlapping dialogue segmentation effects. Difference. SUMMARY OF THE...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/04
CPCG10L15/04
Inventor 王健宗郭卉肖京
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products