Speech-based role separation method and device

A separation method and role technology, applied in voice analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of role separation technology

Active Publication Date: 2021-02-05
ALIBABA GRP HLDG LTD
View PDF19 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a voice-based role separation method and device to solve the problem of relatively low accuracy of existing GMM and HMM-based role separation technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech-based role separation method and device
  • Speech-based role separation method and device
  • Speech-based role separation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0094] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementations disclosed below.

[0095] In this application, a speech-based role separation method and a speech-based role separation apparatus are respectively provided, which will be described in detail in the following embodiments. For ease of understanding, before describing the embodiments, a brief description of the technical background, technical solutions, and writing methods of the embodiments of the present application will be made.

[0096] Existing role separation technologies applied in the field of speech usually use GMM (Gaussian mix...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application discloses a voice-based role separation method, including: extracting feature vectors frame by frame from voice signals to obtain feature vector sequences; assigning role labels to feature vectors; using feature vectors with role labels to train deep neural network DNN models ;According to the DNN model and the hidden Markov model HMM obtained by using the feature vector training, determine the role sequence corresponding to the feature vector sequence, and output the role separation result; wherein, the DNN model is used to output the corresponding feature vector according to the input The probability of each role, HMM is used to describe the jump relationship between roles. The present application also provides a voice-based role separation device. The above-mentioned method provided by this application adopts the DNN model with strong feature extraction ability to model the speaker role, which has a stronger ability to describe the speaker than the traditional GMM, and the description of the role is more refined and accurate, so it can obtain More accurate role separation results.

Description

technical field [0001] The present application relates to the field of speech recognition, in particular to a speech-based role separation method. The present application also relates to a voice-based role separation device. Background technique [0002] Speech is the most natural way for humans to communicate, and speech recognition technology is a technology that allows machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. Speech recognition is an interdisciplinary subject, and the fields involved include: signal processing, pattern recognition, probability theory and information theory, vocal mechanism and auditory mechanism, artificial intelligence, etc. [0003] In practical applications, in order to analyze speech signals more accurately, not only speech recognition is required, but also the speaker of each speech must be identified. Therefore, there is a natural need to separate speech by role. I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/14G10L15/18G10L17/00
CPCG10L15/02G10L15/14G10L15/144G10L15/18G10L17/00G10L25/12G10L25/24
Inventor 李晓辉李宏言
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products