Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition feature extraction method based on PSNCC (perception spectrogram Norm cochlea-filter coefficient)

A speaker recognition and filter coefficient technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as ignoring the time-frequency domain process, declining speaker recognition rate, and weak robustness

Inactive Publication Date: 2017-05-10
SUZHOU UNIV
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] Technical problem to be solved: In order to solve the problem that the robustness of the traditional feature parameter processing method in the frequency domain is not strong, the recognition rate of speaker recognition drops sharply when the signal-to-noise ratio decreases, and the current feature parameters pay more attention to the frequency domain processing to obtain improved robustness, ignoring that the feature extraction process is a complete time-frequency domain process, a speaker recognition feature extraction method that perceives regularized cochlear filter coefficients is proposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition feature extraction method based on PSNCC (perception spectrogram Norm cochlea-filter coefficient)
  • Speaker recognition feature extraction method based on PSNCC (perception spectrogram Norm cochlea-filter coefficient)
  • Speaker recognition feature extraction method based on PSNCC (perception spectrogram Norm cochlea-filter coefficient)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0176] In order to further understand the present invention, the preferred embodiments of the present invention are described below in conjunction with examples, but it should be understood that these descriptions are only to further illustrate the features and advantages of the present invention, rather than limiting the claims of the present invention.

[0177] A method for extracting speaker recognition features of perceptual spectrum regularized cochlear filter coefficients, comprising the following steps:

[0178] Step S1, based on the traveling wave of the cochlear basilar membrane and its impulse response, construct a cochlear filter in the frequency domain that conforms to the psychoacoustic experiment of the human ear, and expand it into a cochlear filter bank with a nonlinear distribution on the Bark scale;

[0179] Step S2, performing speech enhancement based on auditory perception characteristics on the speech and performing two-dimensional enhancement on the time-f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speaker recognition feature extraction method based on a PSNCC (perception spectrogram norm cochlea-filter coefficient). The speaker recognition feature extraction method comprises the following steps: first, constructing a cochlea filter group conforming to traveling wave impulse response and nolinear frequency distribution of a cochlear basilar membrane; then carrying out speech enhancement and two-dimensional enhancement based on auditory perception features on voice, and carrying out two-dimensional boundary detection on a pure voice spectrum structure in continuous distribution, thus obtaining a perception spectrogram structure norm parameter PSN; finally, further normalizing all cochlea-filter coefficients output by the cochlea filter group in the time domain through the perception spectrogram structure norm parameter PSN, and extracting the PSNCC feature parameter. According to the PSNCC feature parameter extracted by adopting the method, the robust performance of the feature parameter is improved from two aspects, namely, the time domain and the frequency domain, and further, the recognition rate of a speaker recognition system under the noise environment with low signal to noise ratio is improved.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method for extracting speaker recognition features by perceiving spectrum regularized cochlear filter coefficients. Background technique [0002] Feature extraction is the first important component in speaker recognition. In general, successful front-end feature parameters should carry enough judgmental information for classification or recognition, be suitable for back-end modeling, and be fairly robust to changes in the acoustic environment. However, achieving satisfactory system performance under different operating modes remains problematic, especially when the auditory training and testing environments are severely mismatched. The robust performance of feature parameters in low SNR environment needs to be further improved to solve this mismatch. [0003] One of the most common features is the Linear Prediction Cepstral Coefficients (LPCC). The speech formant ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/20G10L17/20
CPCG10L15/02G10L15/20G10L17/20
Inventor 吴迪陶智赵鹤鸣肖仲喆张晓俊
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products