Role separation method and device based on voice

A separation method and separation device technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of role separation technology

Active Publication Date: 2017-05-17
ALIBABA GRP HLDG LTD
View PDF19 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a voice-based role separation method and device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Role separation method and device based on voice
  • Role separation method and device based on voice
  • Role separation method and device based on voice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0094] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementations disclosed below.

[0095] In this application, a speech-based role separation method and a speech-based role separation apparatus are respectively provided, which will be described in detail in the following embodiments. For ease of understanding, before describing the embodiments, a brief description of the technical background, technical solutions, and writing methods of the embodiments of the present application will be made.

[0096] Existing role separation technologies applied in the field of speech usually use GMM (Gaussian mix...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application discloses a role separation method based on voice, comprising the following steps: extracting feature vectors frame by frame from a voice signal to get a feature vector sequence; assigning role tags to the feature vectors; using the feature vectors with role tags to train a deep neural network DNN model; and judging a role sequence corresponding to the feature vector sequence according to the DNN model and a hidden Markov model HMM trained using the feature vectors, and outputting a role separation result, wherein the DNN model is used to output the probability of each role according to an input feature vector, and HMM is used to describe the jump relationship between roles. The invention further provides a role separation device based on voice. According to the method provided by the invention, as the DNN model with strong ability of feature extraction is used to model speaker roles, the method has stronger characterization ability than a traditional GMM (Gaussian Mixture Model), role characterization is more detailed and accurate, and therefore, a more accurate role separation result can be obtained.

Description

technical field [0001] The present application relates to the field of speech recognition, in particular to a speech-based role separation method. The present application also relates to a voice-based role separation device. Background technique [0002] Speech is the most natural way for humans to communicate, and speech recognition technology is a technology that allows machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. Speech recognition is an interdisciplinary subject, and the fields involved include: signal processing, pattern recognition, probability theory and information theory, vocal mechanism and auditory mechanism, artificial intelligence, etc. [0003] In practical applications, in order to analyze speech signals more accurately, not only speech recognition is required, but also the speaker of each speech must be identified. Therefore, there is a natural need to separate speech by role. I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/14G10L15/18G10L17/00
CPCG10L15/02G10L15/14G10L15/144G10L15/18G10L17/00G10L25/12G10L25/24
Inventor 李晓辉李宏言
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products