Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker

A neural network and network structure technology, applied in speech analysis, instruments, etc., can solve problems such as mismatch between features and models, difficulty in training, and difficulty in finding features, achieving wide application prospects and a simple and fast recognition process.

Inactive Publication Date: 2017-08-18
SYSU CMU SHUNDE INT JOINT RES INST +1
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a language that integrates multiple end-to-end neural network structures in order to solve the problems of feature extraction and pattern classification that are caused by the separation of feature extraction and pattern classification in the prior art. Human cold symptom recognition method, this method makes the entire speaker cold symptom recognition process simpler and faster by unifying feature learning and pattern classification, and has a wide range of application prospects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker
  • Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker
  • Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] figure 1 The specific implementation process diagram of the method provided by the present invention, as figure 1 As shown, the speaker's cold symptom recognition method that fuses multiple end-to-end neural network structures provided by the present invention comprises the following steps:

[0029] S1. Construct and train an end-to-end neural network A in which the input is speech, and the recognition network is a convolutional neural network and a long-term short-term memory network;

[0030] S2. Construction and training input is speech spectrum, and the recognition network is an end-to-end neural network B of convolutional neural network and long-term short-term memory network;

[0031] S3. Construction and training input is speech spectrum, and the recognition network is an end-to-end neural network C of convolutional neural network and fully connected network;

[0032] S4. Construction and training The input is the voice MFCC feature / CQCC feature, and the recogn...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of a speaker, and the method comprises the following steps: S1, constructing and training inputted voice, wherein a recognition network is an end-to-end neural network A of a convolution neural network and a long-short memory network; S2, constructing and training an inputted voice spectrum, wherein the recognition network is an end-to-end neural network B of the convolution neural network and the long-short memory network; S3, constructing and training the inputted voice spectrum, wherein the recognition network is an end-to-end neural network C of the convolution neural network and a full-connection network; S4, constructing and training the inputted voice MFCC features / CQCC features, wherein the recognition network is an end-to-end neural network D of the long-short memory network; S5, integrating the above four types of trained end-to-end neural networks, and carrying out the recognition of the cold symptoms of the speaker.

Description

technical field [0001] The present invention relates to the field of voiceprint recognition, and more specifically, relates to a speaker's cold symptom recognition method that integrates multiple end-to-end neural network structures. Background technique [0002] Speaker recognition, also known as voiceprint recognition, is a technology that uses pattern recognition technology to automatically identify speakers. The current speaker recognition technology has achieved good performance in experimental conditions, but in practice, the recognized speech will be affected by environmental noise and the speaker's health conditions, which reduces the robustness of the existing speaker recognition technology. Existing speaker recognition methods are mainly used to determine the speaker's identity, and there is no relevant recognition method applied to the speaker's cold symptoms at present. [0003] In the research of speech technology, researchers always hope to find the features t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/66G10L25/30G10L25/24
CPCG10L25/24G10L25/30G10L25/66
Inventor 李明倪志东
Owner SYSU CMU SHUNDE INT JOINT RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products