Hearing perception characteristic-based objective voice quality evaluation method

A speech quality and auditory perception technology, applied in speech analysis, instruments, etc., can solve the problems of not fully reflecting the auditory perception characteristics of the human ear, unfavorable real-time evaluation of speech quality, and large differences in evaluation performance, achieving complex running time and algorithms. The effect of reducing the degree of accuracy, improving the evaluation accuracy, and shortening the running time

Inactive Publication Date: 2013-01-16
CHONGQING UNIV
View PDF4 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The PESQ perceptual speech quality evaluation proposed by ITU-T recommendation P.862 is currently a high-performance objective speech quality evaluation method, which can better identify communication delays, environmental noise and errors, but it is based on the perception model of the Bark spectrum. High computational complexity, which is not conducive to real-time evaluation of voice quality
[0004] The Mel-CD distortion measurement uses MFCC as the speech feature parameter, and has low computational complexity. It is a simple and effective speech quality evaluation method, but its evaluation performance is quite different from that of PESQ.
The analysis shows that although the auditory principle of the human ear and the decorrelation characteristics of Mel cepstrum are used in the extraction process of MFCC feature parameters, it uses a triangular filter bank to simulate the frequency selection characteristics of the cochlear basilar membrane and a logarithmic operation to simulate the amplitude The value nonlinear transformation process does not fully reflect the auditory perception characteristics of the human ear

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hearing perception characteristic-based objective voice quality evaluation method
  • Hearing perception characteristic-based objective voice quality evaluation method
  • Hearing perception characteristic-based objective voice quality evaluation method

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0021] 1. The original speech and the distorted speech of the system under test are first level adjusted to equalize their intensity to an equivalent energy level; then through the ideal band-pass filter, input filtering is performed on the two signals; finally, the time delay generated by the system under test is compensated by time alignment, and the preprocessing process is completed;

[0022] 2. The preprocessed speech signal and The feature parameters are extracted separately;

[0023] attached figure 2 The specific extraction process of the speech signal feature parameters is shown:

[0024] 3. Perform FFT transformation with Hanning window on the speech signal to obtain the signal spectrum ;

[0025] 4. The pitch of the sound heard by the human ear is not linearly proportional to the frequency of the sound. The Mel frequency scale, which is more in line with the auditory characteristics of the human ear, is used for frequency division. The specific relati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hearing perception characteristic-based objective voice quality evaluation method which is simple and effective. An ear hearing model and non-linear compression conversion are introduced into an extraction process of MFCC (Mel frequency cepstrum coefficient) characteristic parameters according to psychoacoustics principles. According to the method, a Gammatone filter is adopted to simulate a cochlea basement membrane; and the strength-loudness perception characteristics of the voice are simulated through cube root non-linear compression conversion in an amplitude non-linear conversion process. By using new characteristic parameters, a voice quality evaluation method which is more accordant with the ear hearing perception characteristics is provided. Compared with other methods, the relevancy between objective evaluation results and subjective evaluation results is effective improved, the operation time is shorter and the complexity is lower, and the method has stronger adaptability, reliability and practicability. A new solution to improve the objective voice quality evaluation can be provided through the method for voice quality evaluation by simulating the hearing perception characteristics of human ears.

Description

technical field [0001] The present invention relates to an objective speech quality evaluation technology based on the auditory perception characteristics of the human ear, and more specifically, to a method of introducing the auditory model of the human ear into the extraction process of MFCC characteristic parameters, and realizing speech quality by calculating the degree of distortion of the characteristic parameters. Methods for objective quality assessment. Background technique [0002] Voice quality evaluation is one of the basic standards to measure the performance of voice communication system. From the perspective of the evaluation subject, it can be divided into two categories: subjective evaluation and objective evaluation. The MOS (Mean Opinion Score) method proposed by ITU-T Recommendation P.830 is a widely used subjective evaluation method, which uses the average opinion score of testers to intuitively reflect people’s perception of voice quality, but such ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/00
Inventor 谭晓衡秦基伟周帅裴婧黄振林唐永刚马旭东
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products