A real-time decomposition/synthesis method of digital speech based on auditory perception characteristics

A technology of digital speech and synthesis method, applied in speech synthesis, speech analysis, speech recognition, etc., can solve the lack of theoretical calculation basis, reduce the operability and repeatability of the method, and do not give the implementation method of the gamma pass filter and other problems to achieve the effect of enhancing operability and solving real-time problems

Active Publication Date: 2020-06-05
TSINGHUA UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0037] 1. This method limits the order of the gamma-pass function to N=4, which is only a special case of the gamma-pass filter, and does not give the implementation method of the gamma-pass filter when N is other values
[0038] 2. Some key parameters of this method are obtained through simulation and lack theoretical calculation basis, mainly including parameter b and normalization parameter g n and channel group delay D m , which reduces method operability and reproducibility
This eventually leads to the suppression of speech at some frequencies of the synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A real-time decomposition/synthesis method of digital speech based on auditory perception characteristics
  • A real-time decomposition/synthesis method of digital speech based on auditory perception characteristics
  • A real-time decomposition/synthesis method of digital speech based on auditory perception characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0098] A kind of real-time decomposition / synthesis method of digital speech based on auditory perception characteristics proposed by the present invention is further described below in conjunction with the accompanying drawings and specific embodiments as follows:

[0099] The main difference between this method and the prior art is that a group of gamma-pass filters are used to simulate the basilar membrane of the human ear, and the filtering characteristics of each position on the basilar membrane can be described by a gamma-pass filter. The method refers to the delay characteristics of the human ear basilar membrane and the equal loudness curve characteristics, and then realizes the decomposition and synthesis of speech.

[0100] The concrete steps of this method are as follows:

[0101] 1) Construct a gamma-pass digital filter model of any order (including bandwidth, center frequency, and position parameter information of each filter):

[0102] Assuming that the number of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a digital speech real-time decomposition / synthesis method based on auditory perception characteristics, and relates to the field of voice signal processing. The method comprises the following steps: forming an N-order Gammatone filter through N-stage-cascaded second-order band-pass filters, and then, constructing an arbitrary-order Gammatone digital filter model and parameters thereof; in the speech decomposition stage, decomposing an input speech into M paths of signals by adopting a floating-point algorithm or a fixed-point algorithm and through M paths of Gammatone filters; and in the speech synthesis stage, introducing time delay in a Gammatone filterbank to accord with characteristics of the human ear better, human ear basilar membrane time delay being inversely proportional to frequency, and finally, carrying out speech synthesis operation. Through reference to equiloudness curve characteristics of the human ear, the speech decomposition / synthesis method is improved, and thus the final speech synthesis effect is allowed to be close to the effect of an ideal band-pass filter. The method can be applied to speech equipment of a mobile phone, an artificial cochlea and a hearing aid and the like.

Description

technical field [0001] The invention belongs to the field of digital voice signal processing, and in particular relates to a real-time decomposition / synthesis method of digital voice based on auditory perception characteristics. Background technique [0002] In daily life, there are various noises. The performance of devices such as speech enhancement and speech recognition will deteriorate significantly in noisy environments, limiting their application scenarios. Because the human ear can still work normally in a noisy environment, and has strong sensitivity and anti-interference ability to sound. Therefore, it is urgent to realize the auditory perception characteristics of the human ear, especially the basilar membrane, in the speech signal processing system. The sensory properties of the basilar membrane of the human ear are: [0003] 1. Frequency selection characteristics: Different frequencies have corresponding resonance points on the basilar membrane. Higher freque...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/24G10L21/0208G10L21/0224G10L13/00
CPCG10L13/00G10L15/24G10L21/0208G10L21/0224
Inventor 李冬梅杨有为贾瑞刘润生
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products