Small data speech acoustic modeling method in speech recognition

A modeling method and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low accuracy of acoustic model recognition and achieve the effect of enriching knowledge

Active Publication Date: 2018-10-19
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF6 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to solve the above-mentioned problems in the prior art, that is, in order to solve the problem of low recognition accuracy of the acoustic model of th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Small data speech acoustic modeling method in speech recognition
  • Small data speech acoustic modeling method in speech recognition
  • Small data speech acoustic modeling method in speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0040] Based on the problem of low speech recognition performance of the existing target language pointed out by the background technology, the present invention provides a small data speech acoustic modeling method in speech recognition, aiming at improving the recognition accuracy of the target language. data voice.

[0041] refer to Figure 1-5 , figure 1 It is a flow chart of the main steps of the small data speech acoustic modeling method in the speech recognition of the present invention; figure 2It is a flow chart of specific steps of using multilingual confrontation training technology to train the bottleneck network model of multi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of signal processing in the electronic industry, and aims at solving a problem that the discrimination performance of an acoustic model of a target language with just a little mark data is low. In order to solve the above problem, the invention provides a small data speech acoustic modeling method in speech recognition, and the method comprises the steps: carrying out the adversarial training of the acoustic features of many languages through a language adversarial discriminator, so as to build a multi-language adversarial bottleneck network model;taking the acoustic features of a target language as the input of the multi-language adversarial bottleneck network model, so as to extract the bottleneck features which is irrelevant to the language;carrying out the fusion of the bottleneck features which is irrelevant to the language with the acoustic features of the target language, so as to obtain fusion features; carrying out the training through the fusion features, so as to build an acoustic model of the target language. The method effectively irons out the defects, caused by a condition that the bottleneck information comprises the information correlated with the language, of the unremarkable improvement of the recognition performance of the target language and even the negative migration phenomenon in the prior art, thereby improving the voice recognition precision of the target language.

Description

technical field [0001] The invention relates to the technical field of signal processing in the electronics industry, in particular to a small data speech acoustic modeling method in speech recognition. Background technique [0002] Voice interaction is the most natural way of human-computer interaction, and voice recognition is the most important technology in voice interaction. In recent years, with the in-depth application of deep learning technology in speech recognition, speech recognition technology has made major breakthroughs. [0003] As we all know, deep learning requires a large amount of labeled data. For Mandarin or English, these labeled data are easy to obtain. However, for dialects, it is difficult to obtain a large amount of labeled data, and only a small amount of labeled data can be collected. Therefore, for establishing an acoustic model of a target language (such as a dialect) with a small amount of labeled data, the effect of using conventional deep le...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/06G10L15/02
CPCG10L15/02G10L15/063
Inventor 陶建华易江燕温正棋
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products