Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Statistical-acoustic-model adaptation method, acoustic-model learning method suitable for statistical-acoustic-model adaptation, storage medium in which parameters for building deep neural network are stored, and computer program for adapting statistical acoustic model

一种统计声学模型、深度神经网络的技术,应用在深度神经网络领域,能够解决学习需要很长时间等问题

Inactive Publication Date: 2016-07-06
NAT INST OF INFORMATION & COMM TECH
View PDF13 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, learning can still take a long time when large amounts of data are used for learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Statistical-acoustic-model adaptation method, acoustic-model learning method suitable for statistical-acoustic-model adaptation, storage medium in which parameters for building deep neural network are stored, and computer program for adapting statistical acoustic model
  • Statistical-acoustic-model adaptation method, acoustic-model learning method suitable for statistical-acoustic-model adaptation, storage medium in which parameters for building deep neural network are stored, and computer program for adapting statistical acoustic model
  • Statistical-acoustic-model adaptation method, acoustic-model learning method suitable for statistical-acoustic-model adaptation, storage medium in which parameters for building deep neural network are stored, and computer program for adapting statistical acoustic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In the following description and drawings, the same reference numerals are attached to the same components. Therefore, detailed descriptions for these components will not be repeated. In addition, the following embodiments are mainly examples related to adaptation under the condition of a specific speaker in voice recognition, but the present invention is not limited to such embodiments. For example, it can also be applied to adaptation to conditions such as a noisy environment.

[0039] [structure]

[0040] As described above, when speaker adaptation is performed using an acoustic model using DNN, after DNN learning is performed using voice data of an unspecified speaker, only specific layers need to use the voice data of the speaker to be adapted to learn. At this time, the parameters of layers other than this layer are fixed, and learning is not performed. However, this often has the problem that the sound data is insufficient and learning for adaptation takes a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a statistical-acoustic-model adaptation method with which learning data having specific conditions can be used to efficiently adapt acoustic models using deep neural networks (DNN), and with which accuracy can also be improved. A speaker adaptation method for acoustic models using DNNs includes: a step in which utterance data (90-98) for different speakers is separately stored in first storage devices; a step in which hidden layer modules (112-120) for separate speakers are prepared; a step in which preliminary learning for all the layers (42, 44, 110, 48, 50, 52, 54) in a DNN (80) is performed while switching and selecting the utterance data (90-98), and dynamically substituting a specific layer (110) with the hidden layer modules (112-120) corresponding to the selected utterance data; a step in which the DNN specific layer (110), after the preliminary learning has been completed therefor, is substituted with an initial hidden layer; and a step in which parameters for layers other than the initial hidden layer are fixed, and speech data of a specific speaker is used to perform DNN learning.

Description

technical field [0001] The present invention relates to a deep neural network (hereinafter, abbreviated as "DNN" for simplification of description) used in recognition technologies such as voice recognition, and particularly relates to a technique for improving the learning efficiency of a DNN for specific objects. Background technique [0002] As a method of machine learning, DNN has attracted much attention. DNN is suitable for, for example, image recognition and voice recognition, and it is reported that the relative error rate is reduced by 20 to 30% compared to the conventional one, and can exhibit excellent performance (Non-Patent Documents 1 to 3). [0003] The so-called DNN refers to a neural network with more layers than before. Specifically, DNN includes: an input layer, an output layer, and multiple hidden layers arranged between the input layer and the output layer. The input layer has a plurality of input nodes (neurons: neurons). The output layer has only th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/07G06N3/00G06N3/08G10L15/16
CPCG06N3/082G10L15/16G06N3/045G06N3/08G06N3/04G10L15/063G10L15/075
Inventor 松田繁树卢绪刚
Owner NAT INST OF INFORMATION & COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products