Method of recognizing ear speech in normal speech flow under condition of small database

A recognition method and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of small sample data and difficulty in improving the recognition rate, and achieve the effect of improving the recognition rate, ensuring the recognition rate, and improving the recognition performance

Active Publication Date: 2017-01-11
SUZHOU UNIV
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a method for recognizing middle ear speech in normal speech streams under the condition of a small database, so as to solve the difficulty in improving the recognition rate due to the small amount of sample data when identifying isolated words in ear speech in normal speech streams. The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of recognizing ear speech in normal speech flow under condition of small database
  • Method of recognizing ear speech in normal speech flow under condition of small database
  • Method of recognizing ear speech in normal speech flow under condition of small database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] Embodiment one: see attached figure 2 Shown, the recognition method of middle ear speech of normal speech flow under a kind of small database condition, comprises the following steps:

[0029] (1) Construct speech recognition classification system, described speech recognition classification system comprises:

[0030] Digital voice input module, used for sampling or reading the voice stream signal containing ear voice;

[0031] The feature extraction module is used to extract spectral features; the selected spectral features include 12-order MFCC, logarithmic energy, 0-order cepstral coefficient, first-order derivative, and second-order derivative, with a frame length of 25 milliseconds and a frame shift of 10 milliseconds.

[0032] Deep Neural Networks, see attached image 3 As shown, it consists of a deep belief network and a Softmax output layer;

[0033] The deep belief network is composed of a plurality of restricted Boltzmann machine stacks from bottom to top,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method of recognizing ear speech in a normal speech flow under the condition of a small database, comprising the following steps: building a speech recognition and classification system which comprises a digital speech input module, a feature extraction module, and a deep neural network composed of a deep belief network and a Softmax output layer, wherein the deep belief network is composed of restricted Boltzmann machines stacked from bottom to top, spectrum feature is the input feature of the deep belief network, and the Softmax output layer correspondingly outputs a target value of the deep neural network; acquiring training data set samples and processing the samples in at least one of the following ways: (1) artificially extending the data set 8 to 24 times by repetition; and (2) scrambling the data set, and training the speech recognition and classification system; and using the trained classification system to recognize a to-be-recognized speech flow. The recognition performance of the system can be improved in a small database, and the recognition rate of normal speech is guaranteed while ear speech recognition is realized.

Description

technical field [0001] The invention relates to a speech signal processing technology, in particular to a method for recognizing ear speech appearing in Chinese normal speech flow under the condition of low resource and small database. Background technique [0002] Otospeech is a special form of communication in which the sound is low and the vocal cords do not vibrate at all. It is significantly less perceptual and intelligible than normal speech. Nevertheless, ear speech is also a natural speech form of people's daily communication. It is often used for communication in a quiet or confidential environment, such as when the speaker does not want to disturb others, or there is some private information to be exchanged. With the development of communication technology, the human-computer interaction interface of ear voice is getting more and more attention, such as the use of handheld devices such as smart phones in company meetings or public places, and digital password s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/08G10L15/16G10L15/20
CPCG10L15/063G10L15/08G10L15/16G10L15/20
Inventor 陈雪勤刘正赵鹤鸣
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products