Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A neural network and network structure technology, applied in speech analysis, instruments, etc., can solve problems such as mismatch between features and models, difficulty in training, and difficulty in finding features, achieving wide application prospects and a simple and fast recognition process.

Inactive Publication Date: 2017-08-18

SYSU CMU SHUNDE INT JOINT RES INST +1

View PDF3 Cites 14 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The present invention provides a language that integrates multiple end-to-end neural network structures in order to solve the problems of feature extraction and pattern classification that are caused by the separation of feature extraction and pattern classification in the prior art. Human cold symptom recognition method, this method makes the entire speaker cold symptom recognition process simpler and faster by unifying feature learning and pattern classification, and has a wide range of application prospects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] figure 1 The specific implementation process diagram of the method provided by the present invention, as figure 1 As shown, the speaker's cold symptom recognition method that fuses multiple end-to-end neural network structures provided by the present invention comprises the following steps:

[0029] S1. Construct and train an end-to-end neural network A in which the input is speech, and the recognition network is a convolutional neural network and a long-term short-term memory network;

[0030] S2. Construction and training input is speech spectrum, and the recognition network is an end-to-end neural network B of convolutional neural network and long-term short-term memory network;

[0031] S3. Construction and training input is speech spectrum, and the recognition network is an end-to-end neural network C of convolutional neural network and fully connected network;

[0032] S4. Construction and training The input is the voice MFCC feature / CQCC feature, and the recogn...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of a speaker, and the method comprises the following steps: S1, constructing and training inputted voice, wherein a recognition network is an end-to-end neural network A of a convolution neural network and a long-short memory network; S2, constructing and training an inputted voice spectrum, wherein the recognition network is an end-to-end neural network B of the convolution neural network and the long-short memory network; S3, constructing and training the inputted voice spectrum, wherein the recognition network is an end-to-end neural network C of the convolution neural network and a full-connection network; S4, constructing and training the inputted voice MFCC features / CQCC features, wherein the recognition network is an end-to-end neural network D of the long-short memory network; S5, integrating the above four types of trained end-to-end neural networks, and carrying out the recognition of the cold symptoms of the speaker.

Description

technical field [0001] The present invention relates to the field of voiceprint recognition, and more specifically, relates to a speaker's cold symptom recognition method that integrates multiple end-to-end neural network structures. Background technique [0002] Speaker recognition, also known as voiceprint recognition, is a technology that uses pattern recognition technology to automatically identify speakers. The current speaker recognition technology has achieved good performance in experimental conditions, but in practice, the recognized speech will be affected by environmental noise and the speaker's health conditions, which reduces the robustness of the existing speaker recognition technology. Existing speaker recognition methods are mainly used to determine the speaker's identity, and there is no relevant recognition method applied to the speaker's cold symptoms at present. [0003] In the research of speech technology, researchers always hope to find the features t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L25/66G10L25/30G10L25/24

CPCG10L25/24G10L25/30G10L25/66

Inventor 李明倪志东

Owner SYSU CMU SHUNDE INT JOINT RES INST

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method, integrated with multiple types of end-to-end neural network structures, for cold symptoms of speaker

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology