Disguised voice detection method based on complete and local binary patterns

A local binary, speech detection technology, applied in speech analysis, speech recognition, character and pattern recognition, etc., can solve the problem of unsatisfactory detection effect, and achieve the effect of clear texture, strong generalization ability and good effect.

Inactive Publication Date: 2019-08-20
HANGZHOU DIANZI UNIV
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In practice, because the speaker recognition system has to deal with various unknown disguised voice attacks, the detection effect based on the above characteristics is often not very ideal.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Disguised voice detection method based on complete and local binary patterns
  • Disguised voice detection method based on complete and local binary patterns
  • Disguised voice detection method based on complete and local binary patterns

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to illustrate the embodiments of the present invention more clearly, the specific implementation manners of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention, and those skilled in the art can obtain other accompanying drawings based on these drawings and obtain other implementations.

[0059] The method for detecting fake speech based on the complete local binary pattern in the embodiment of the present invention utilizes the complete local binary pattern (Completed Local Binary Pattern, CLBP) to extract the texture features of the spectrogram of the speaker's real speech signal and the fake speech signal and use it to train Support Vector Machines for Pseudo-Speech Classification Functions Can Efficiently Implement Anti-Masquerade Detection.

[0060] In order to extract the texture features of speech, it...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a disguised voice detection method based on complete and local binary patterns. The disguised voice detection method comprises the steps of carrying out variable Q conversionon all voices in a real voice library and a corresponding disguised voice library, so as to obtain voice spectrograms of all true voices and disguised voices; converting the voice spectrograms into corresponding grayscale images, and generating corresponding texture characteristics by processing the grayscale images through the complete and local binary patterns; obtaining a support vector machinethrough training by using all the texture characteristics as a training set for training the support vector machine; and inputting a to-be-recognized voice to the support vector machine so as to recognize the disguised voice. The voice spectrograms obtained through variable Q conversion are clearer in texture, and beneficial to extracting the texture characteristics of voice signals; moreover, the complete and local binary patterns are adopted, local symbol difference value information and local amplitude difference value information of the voice spectrograms are included, the texture characteristics of the signals can be obtained more comprehensively, the classification by the support vector machine is facilitated, and the accuracy of recognizing the disguised voice is improved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and in particular relates to a method for detecting a fake speech based on a complete partial binary pattern. Background technique [0002] Masquerade voice detection is to analyze the speaker's voice, and then identify whether it is the real speaker's voice or the artificially maliciously disguised voice. Masquerading voice is usually generated by device playback, voice conversion and speech synthesis technology. Through these deliberate operations, it can be disguised as a specific speaker's voice, so as to deceive the speaker recognition system. The masquerade speech recognition system can realize anti-masquerade detection for malicious masquerade speech, improve the security performance of the speaker recognition system, and has broad application prospects. Masquerade speech recognition usually needs to extract the features of the target speech signal, and then compare and analyze...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/18G10L25/45G10L17/04G10L15/08G06K9/62G06K9/00
CPCG10L25/18G10L25/45G10L17/04G10L15/08G06F2218/08G06F2218/12G06F18/2411G06F18/214
Inventor 简志华徐剑郭珊金易帆
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products