Psychological acoustic model-based voice post-perception filter

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A psychoacoustic model and filter technology, which can be used in speech analysis, instruments, etc., to solve problems such as residual noise in speech and poor auditory perception.

Inactive Publication Date: 2014-05-28

TAIYUAN UNIV OF TECH

View PDF4 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Aiming at the problem of residual noise in the enhanced speech, resulting in poor auditory perception, the present invention proposes a post-perceptual filter based on a psychoacoustic model, and applies it to speech enhancement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0020] Hereinafter, the invention will now be described more fully with reference to the accompanying drawings, in which various embodiments are shown. However, this invention may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

[0021] Hereinafter, exemplary embodiments of the present invention will be described in more detail with reference to the accompanying drawings.

[0022] Using MATLAB to simulate the speech enhanced by Spectral Subtraction (SS) and Wiener Filtering (WF) plus post-perceptual filter, the speech is taken from the English male voice in the 863 speech library: "The birch canoe slid on the smooth planks.”, the sampling rate is 8kHz, the frame length K is 160, and the frame overlap is 50%; the noise is Gaussian white noise an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a psychological acoustic model-based voice post-perception filter. The perception filters does not need to be fused in all algorithms, so that the complexity of the algorithms is not influenced; but the identical auditory perception enhancement effect can be obtained. Because the re-processing process of voice enhancement is focused, the auditory perception of the enhanced voice is further improved; and even under the circumstances that noise exists and the signal to noise ratio is not improved, the objective of auditory perception improvement can be achieved by using the post-perception filter. The filter is established under the circumstances that the voice signal distortion is in a minimum state and on the condition that the residual noises are not heard by human ears. Moreover, the gain of the filter is obtained by constructing a cost function containing a masking threshold on the condition; and further optimization is carried out by a perception normalization factor constructed by the masking threshold. Therefore, excessive signal weakening can be avoided and the minimum voice perception distortion after enhancement can be ensured.

Description

technical field [0001] The invention relates to a speech post-perceptual filter based on a psychoacoustic model. Background technique [0002] At present, various speech enhancement algorithms can remove noise to varying degrees, but there are more or less residual noise and music noise, which affect the quality of speech, so it needs to be further eliminated; plus the final evaluation of speech depends on Therefore, the research on speech enhancement should combine the use of the human auditory system's perceptual characteristics of speech, that is, the masking effect of the human ear, which has a special suppression function for unwanted noise, so that the enhanced speech can be reduced as much as possible. Reduce hearing fatigue, improve hearing sensitivity, and improve voice quality. Therefore, combining the masking effect in the auditory characteristics of the human ear plays a very important role in the performance of speech enhancement. [0003] In recent years, man...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0208

Inventor贾海蓉李鸿燕武奕峰张雪英

OwnerTAIYUAN UNIV OF TECH

Psychological acoustic model-based voice post-perception filter

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology