Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Auditory perception characteristic-based speech quality objective evaluating method

A voice quality and objective evaluation technology, applied in voice analysis, instruments, etc., can solve the problems of unfavorable real-time evaluation of voice quality and high computational complexity, and achieve the effect of facilitating performance analysis, improving correlation, and avoiding energy leakage

Active Publication Date: 2015-04-01
HUNAN INST OF METROLOGY & TEST +1
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] P.862 Perceptual Evaluation of Speech Quality Analysis released by ITU-T in 2001 is an objective evaluation method of voice quality with high performance at present, which can better identify communication delay, environmental noise and errors, but it is based on The perceptual model of the Bark spectrum has high computational complexity, which is not conducive to real-time evaluation of voice quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Auditory perception characteristic-based speech quality objective evaluating method
  • Auditory perception characteristic-based speech quality objective evaluating method
  • Auditory perception characteristic-based speech quality objective evaluating method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] 1. Gammatone filter

[0045] The Gammatone filter is a standard cochlear auditory filter, and the time-domain impulse response of the filter is:

[0046] g(t)=B n t n-1 e -2πBt cos(2πf 0 t+φ)u(t) (1)

[0047] Among them: u(t)=0 when t0; parameter B=b 1 ERB(f 0 ), ERB (f 0 ) is the equivalent rectangular bandwidth of the Gammatone filter (equivalent rectangular bandwidth: for the same white noise input, the width of the rectangular filter with the same energy as the specified filter, referred to as ERB), which is the same as the Gammatone filter center frequency f 0 The relation is ERB(f 0 )=24.7+0.108f 0 , parameter b 1 = 1.019 is a parameter introduced to make the function more consistent with physiological data; n is the order of the filter, and research shows that the Gammatone filter with n=4 can well simulate the filtering characteristics of the basilar membrane; the parameter φ is the initial phase of the filter.

[0048] The frequency response charac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an auditory perception characteristic-based speech quality objective evaluating method. The method is characterized in that a frequency spectrum is mapped into a Buck spectrum module, and the Buck spectrum module is added into a Gammatone filter bank to filter. The method comprises the following specific steps of (1) processing a reference signal and a degraded signal through POLQA (Perceptual Objective Listening Quality Analysis), and adding the reference signal and the degraded signal into a core model; (2) mapping the frequency spectrum in the core model into the Buck spectrum module, adding the Buck spectrum module into the Gammatone filter bank to filter, and performing acoustic conversion to enable the extracted auditory spectrum to be more approximate to the auditory perception of ears of people; (3) after performing acoustic conversion, performing interference analysis to analyze the distortion of the degraded signal relative to the reference signal so as to obtain an objective evaluation MOS score. Compared with the other methods, the method has the advantage that the relevancy between an objective evaluation result and a subjective evaluation result is effectively improved.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a method for objectively evaluating speech quality based on auditory perception characteristics. Background technique [0002] Voice quality evaluation can be divided into two categories from the evaluation subject: subjective evaluation and objective evaluation. [0003] Subjective evaluation uses people as the main body to evaluate the quality of speech. Although this method is relatively complicated, since people are the final recipients of speech, this evaluation is a true reflection of speech quality. The Mean Opinion Score (MOS) proposed by the ITU organization in 1996 is a widely used subjective evaluation method, which uses the average opinion score of testers to intuitively reflect people's perception of voice quality. The advantage of subjective evaluation is that it conforms to people's feelings about voice quality, but the disadvantages are that it i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/60
Inventor 李庆生刘良江卞昕柏文琦周鑫彭正梁徐昱
Owner HUNAN INST OF METROLOGY & TEST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products