Speech-based role separation method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A separation method and role technology, applied in voice analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of role separation technology

Active Publication Date: 2021-02-05

ALIBABA GRP HLDG LTD

View PDF19 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The embodiment of the present application provides a voice-based role separation method and device to solve the problem of relatively low accuracy of existing GMM and HMM-based role separation technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0094] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementations disclosed below.

[0095] In this application, a speech-based role separation method and a speech-based role separation apparatus are respectively provided, which will be described in detail in the following embodiments. For ease of understanding, before describing the embodiments, a brief description of the technical background, technical solutions, and writing methods of the embodiments of the present application will be made.

[0096] Existing role separation technologies applied in the field of speech usually use GMM (Gaussian mix...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application discloses a voice-based role separation method, including: extracting feature vectors frame by frame from voice signals to obtain feature vector sequences; assigning role labels to feature vectors; using feature vectors with role labels to train deep neural network DNN models ;According to the DNN model and the hidden Markov model HMM obtained by using the feature vector training, determine the role sequence corresponding to the feature vector sequence, and output the role separation result; wherein, the DNN model is used to output the corresponding feature vector according to the input The probability of each role, HMM is used to describe the jump relationship between roles. The present application also provides a voice-based role separation device. The above-mentioned method provided by this application adopts the DNN model with strong feature extraction ability to model the speaker role, which has a stronger ability to describe the speaker than the traditional GMM, and the description of the role is more refined and accurate, so it can obtain More accurate role separation results.

Description

technical field [0001] The present application relates to the field of speech recognition, in particular to a speech-based role separation method. The present application also relates to a voice-based role separation device. Background technique [0002] Speech is the most natural way for humans to communicate, and speech recognition technology is a technology that allows machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. Speech recognition is an interdisciplinary subject, and the fields involved include: signal processing, pattern recognition, probability theory and information theory, vocal mechanism and auditory mechanism, artificial intelligence, etc. [0003] In practical applications, in order to analyze speech signals more accurately, not only speech recognition is required, but also the speaker of each speech must be identified. Therefore, there is a natural need to separate speech by role. I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/02G10L15/14G10L15/18G10L17/00

CPCG10L15/02G10L15/14G10L15/144G10L15/18G10L17/00G10L25/12G10L25/24

Inventor李晓辉李宏言

OwnerALIBABA GRP HLDG LTD

Speech-based role separation method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology