Voice HNR automatic analytical method

An analysis method and technology of harmonic-to-noise ratio, applied in speech analysis, instruments, etc., can solve the problems of unreasonable scoring and matching, difficulty in estimating the period, and difficulty in reproducing the calculation process, so as to improve the sample utilization rate, eliminate manual intervention, The effect of eliminating distracting factors

Active Publication Date: 2009-06-10
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are some defects in this method: 1. For some voice samples with more serious pathological changes or hoarse voice samples, the estimation of the fundamental frequency period is prone to deviation, and sometimes it is even difficult to estimate the period of the sample, so that It is impossible to calculate the effective harmonic-to-noise ratio value; 2. These harmonic calculations are carried out in the ordinary time-frequency domain, which is very different from the real human ear perception, so in practical applications, the final calculation results Unreasonable match with subjective assessment scores by voice experts
This method adds manual intervention, which is not only time-consuming and laborious, but also has a subjective impact on the results, making it difficult to reproduce the calculation process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice HNR automatic analytical method
  • Voice HNR automatic analytical method
  • Voice HNR automatic analytical method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] The overall process of this embodiment can refer to image 3 , the technical details involved in this embodiment are described below.

[0035] 1. Segment the data and automatically find the stable part of each pronunciation, including:

[0036] 1) Use energy-based voice activity detection technology (VAD) to find out the starting and ending points of multiple segments of speech, and analyze each segment of speech separately;

[0037] 2) Framing, for each frame, calculate the average value of the energy of the frame and the entire energy, let the signal be S, divide N frames in total, each frame has m samples, then the energy E(n) of the nth frame for

[0038] E ( n ) = Σ i = 1 m ( S ( ( n ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an automatic voice harmonic-to-noise ratio analysis method, which comprises: 1) an effective phonetic segment for harmonic-to-noise ratio analysis is segmented from the recording; 2) the phonetic segment is subject to filtering processing based on an auditory model, and then a two-dimensional energy correlation coefficient of a time domain and a frequency domain in various filter channels in the auditory model is calculated; and 3) the threshold value of the correlation coefficient is preset, a coordinate point of the time domain and the frequency domain corresponding to the correlation coefficient has a harmonic component when the correlation coefficient obtained in step 2) is larger than the threshold, or else the coordinate point of the time domain and the frequency domain corresponding to the correlation coefficient has a noise component, and finally the ratio of the harmonic component and the noise component is calculated to obtain the harmonic-to-noise ratio. The method uses the correlation between the time domain characterized by an autocorrelogram and a cochlear spectral domain channel to judge the harmonic component, is not affected by the detection position of a fundamental frequency, and can detect the harmonic component more accurately and more robustly. As the cochlear spectrum is used, the method is more matched with the actual hearing of human ears.

Description

technical field [0001] The invention belongs to the technical field of speech signal processing, in particular, the invention relates to a harmonic-to-noise ratio analysis method in automatic voice evaluation. Background technique [0002] Harmonic to Noise Ration (HNR) is the main indicator for analyzing and evaluating long vowels. The traditional calculation method of harmonic-to-noise ratio is to use the autocorrelation of the periodic signal in the time domain to estimate the period of the fundamental frequency of the signal. Then, according to the position of the fundamental frequency cycle, the part with strong correlation near the fundamental frequency cycle is regarded as the harmonic component, and the part with weak or irrelevant correlation is regarded as noise to calculate the harmonic noise ratio. There are some defects in this method: 1. For some voice samples with more serious pathological changes or hoarse voice samples, the estimation of the fundamental fre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/00G10L25/48
Inventor 颜永红王迪张建平
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products