Role separation method and device based on voice

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A separation method and separation device technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of role separation technology

Active Publication Date: 2017-05-17

ALIBABA GRP HLDG LTD

View PDF19 Cites 41 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The embodiment of the present application provides a voice-based role separation method and device to solve the problem of relatively low accuracy of existing GMM and HMM-based role separation technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0094] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementations disclosed below.

[0095] In this application, a speech-based role separation method and a speech-based role separation apparatus are respectively provided, which will be described in detail in the following embodiments. For ease of understanding, before describing the embodiments, a brief description of the technical background, technical solutions, and writing methods of the embodiments of the present application will be made.

[0096] Existing role separation technologies applied in the field of speech usually use GMM (Gaussian mix...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application discloses a role separation method based on voice, comprising the following steps: extracting feature vectors frame by frame from a voice signal to get a feature vector sequence; assigning role tags to the feature vectors; using the feature vectors with role tags to train a deep neural network DNN model; and judging a role sequence corresponding to the feature vector sequence according to the DNN model and a hidden Markov model HMM trained using the feature vectors, and outputting a role separation result, wherein the DNN model is used to output the probability of each role according to an input feature vector, and HMM is used to describe the jump relationship between roles. The invention further provides a role separation device based on voice. According to the method provided by the invention, as the DNN model with strong ability of feature extraction is used to model speaker roles, the method has stronger characterization ability than a traditional GMM (Gaussian Mixture Model), role characterization is more detailed and accurate, and therefore, a more accurate role separation result can be obtained.

Description

technical field [0001] The present application relates to the field of speech recognition, in particular to a speech-based role separation method. The present application also relates to a voice-based role separation device. Background technique [0002] Speech is the most natural way for humans to communicate, and speech recognition technology is a technology that allows machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. Speech recognition is an interdisciplinary subject, and the fields involved include: signal processing, pattern recognition, probability theory and information theory, vocal mechanism and auditory mechanism, artificial intelligence, etc. [0003] In practical applications, in order to analyze speech signals more accurately, not only speech recognition is required, but also the speaker of each speech must be identified. Therefore, there is a natural need to separate speech by role. I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/14G10L15/18G10L17/00

CPCG10L15/02G10L15/14G10L15/144G10L15/18G10L17/00G10L25/12G10L25/24

Inventor李晓辉李宏言

OwnerALIBABA GRP HLDG LTD

Role separation method and device based on voice

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology