Acoustic feature extraction method for playback attack detection in vocal print recognition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of playback attack and voiceprint recognition, which is applied in the field of acoustic feature extraction for recording playback attack detection in voiceprint recognition, can solve problems such as insufficiency, and achieve the effects of improving performance, shortening time, and strengthening channel differences.

Active Publication Date: 2019-10-01

上海企创信息科技有限公司

View PDF5 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In fact, compared with the original voice, the recording and playback attack voice has two additional processes of recording and playback, and the frequency response characteristics of the recording device and the playback device are non-uniform, so that the frequency spectrum will be different in the low frequency band and the high frequency band. Therefore, only emphasizing the spectral information in the low frequency band is not sufficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0055] Such as figure 1 As shown, it is an acoustic feature extraction method for recording playback attack detection in voiceprint recognition in an embodiment of the present invention, and the method includes the following steps:

[0056] S10. Preprocessing the input voice;

[0057] The preprocessing in step S10 includes framing, windowing and denoising, and step S10 specifically includes the following steps:

[0058] S11, the input voice signal x (n) is divided into frames, the voice signal is divided into a plurality of voice frames with a frame length of N (actually optional 1024), there is overlap between adjacent two frames, and the frame shift is L (actual optional256);

[0059] S12, adding a window to each frame of voice signal x(i, n) after framing, multiplying each frame of voice signal by a Hamming window with a window length of N, to obtain a windowed voice frame Calculated as follows:

[0060]

[0061] S13. Calculate the short-term energy SE(i) of each fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an acoustic feature extraction method for playback attack detection in vocal print recognition. The method includes the steps that input voice is preprocessed; Fourier transform is conducted on each frame of preprocessed voice signals, a time-domain signal is converted into a frequency-domain signal, and spectra of the voice signals are obtained; spectral line energy of each frame of the voice signals after Fourier transform is calculated; asymmetric hyperbolic sinusoidal frequency scaling filtering is conducted on each frame of the voice signals according to the spectral line energy; logarithmic transformation is conducted on each frame of the filtered voice signals to obtain logarithmic energy spectra of frames of the voice signals; and discrete cosine transform is conducted on the logarithmic energy spectra of frames of the voice signals to obtain hyperbolic sinusoidal cepstrum coefficients of frames of the voice signals. According to the acoustic feature extraction method for playback attack detection in vocal print recognition, through asymmetric hyperbolic sinusoidal frequency scaling, the effective utilization method of voice frequency spectrum information is specified, the information channel difference of original voice and playback attack voice is strengthened, and the performance of playback attack detection can be improved.

Description

technical field [0001] The invention relates to the technical field of acoustic signal processing, in particular to an acoustic feature extraction method for recording playback attack detection in voiceprint recognition. Background technique [0002] Voiceprint recognition is an identification technology based on biometrics, which can identify the speaker's identity through the speaker's voice characteristics. Another widely used biometric identification technology. However, the security application of the voiceprint recognition system must solve the problem of spoofing attacks, including speech synthesis spoofing attacks and recording playback spoofing attacks. Due to the high similarity between the recorded and played back voice and the original voice, the biggest challenge is the recording and playback attack. [0003] Early recording and playback attack detection Due to the lack of public large corpus databases and baseline systems, it is difficult for developers to car...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/02G10L17/22G10L19/02G10L19/26G10L25/24

CPCG10L17/02G10L17/22G10L19/02G10L19/26G10L25/24

Inventor 俞一彪郭星辰

Owner 上海企创信息科技有限公司

Acoustic feature extraction method for playback attack detection in vocal print recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology