Psychological acoustic model-based voice post-perception filter

A psychoacoustic model and filter technology, which can be used in speech analysis, instruments, etc., to solve problems such as residual noise in speech and poor auditory perception.

Inactive Publication Date: 2014-05-28
TAIYUAN UNIV OF TECH
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem of residual noise in the enhanced speech, resulting in poor auditory perception, the pre

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Psychological acoustic model-based voice post-perception filter
  • Psychological acoustic model-based voice post-perception filter
  • Psychological acoustic model-based voice post-perception filter

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] Hereinafter, the invention will now be described more fully with reference to the accompanying drawings, in which various embodiments are shown. However, this invention may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

[0021] Hereinafter, exemplary embodiments of the present invention will be described in more detail with reference to the accompanying drawings.

[0022] Using MATLAB to simulate the speech enhanced by Spectral Subtraction (SS) and Wiener Filtering (WF) plus post-perceptual filter, the speech is taken from the English male voice in the 863 speech library: "The birch canoe slid on the smooth planks.”, the sampling rate is 8kHz, the frame length K is 160, and the frame overlap is 50%; the noise is Gaussian white noise an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a psychological acoustic model-based voice post-perception filter. The perception filters does not need to be fused in all algorithms, so that the complexity of the algorithms is not influenced; but the identical auditory perception enhancement effect can be obtained. Because the re-processing process of voice enhancement is focused, the auditory perception of the enhanced voice is further improved; and even under the circumstances that noise exists and the signal to noise ratio is not improved, the objective of auditory perception improvement can be achieved by using the post-perception filter. The filter is established under the circumstances that the voice signal distortion is in a minimum state and on the condition that the residual noises are not heard by human ears. Moreover, the gain of the filter is obtained by constructing a cost function containing a masking threshold on the condition; and further optimization is carried out by a perception normalization factor constructed by the masking threshold. Therefore, excessive signal weakening can be avoided and the minimum voice perception distortion after enhancement can be ensured.

Description

technical field [0001] The invention relates to a speech post-perceptual filter based on a psychoacoustic model. Background technique [0002] At present, various speech enhancement algorithms can remove noise to varying degrees, but there are more or less residual noise and music noise, which affect the quality of speech, so it needs to be further eliminated; plus the final evaluation of speech depends on Therefore, the research on speech enhancement should combine the use of the human auditory system's perceptual characteristics of speech, that is, the masking effect of the human ear, which has a special suppression function for unwanted noise, so that the enhanced speech can be reduced as much as possible. Reduce hearing fatigue, improve hearing sensitivity, and improve voice quality. Therefore, combining the masking effect in the auditory characteristics of the human ear plays a very important role in the performance of speech enhancement. [0003] In recent years, man...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/0208
Inventor 贾海蓉李鸿燕武奕峰张雪英
Owner TAIYUAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products