Voice playback detection method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A detection method and voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as voice replay, and achieve the effect of avoiding replay attacks

Active Publication Date: 2016-06-22

TSINGHUA UNIV +1

View PDF16 Cites 43 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] This application provides a recording playback detection method and device to solve the problem of voice playback in speaker recognition technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0051] refer to figure 2 , which shows a flow chart of a voice playback detection method described in Embodiment 1 of the present application, specifically including:

[0052] Step 201: Establish a user channel model according to the reserved training voice of the target user.

[0053] The reserved training voice of the target user is acquired in advance, and a user channel model is established according to the acquired reserved training voice of the target user.

[0054] The reserved training voice can be obtained from the background server or the client of the target user, or can be obtained in other ways, which is not specifically limited in this application.

[0055] Step 202: Calculate the trust score of the speech to be recognized on the user channel model.

[0056] This application uses the user channel model to score the trust degree of the speech to be recognized input by the user terminal, obtain the trust degree score of the speech to be recognized, and judge whe...

Embodiment 2

[0068] refer to Figure 5 , which shows a flow chart of a voice playback detection method described in Embodiment 2 of the present application, specifically including:

[0069] Step 501: Establish a user channel model according to the reserved training voice of the target user.

[0070] Step 501 includes the following sub-steps:

[0071] Sub-step 5011: Calculate the sum of the squares of the sampling values of the currently reserved training speech segment to obtain the energy of the currently reserved training speech segment. If the energy is lower than the set threshold, the training speech segment is determined to be a low-energy speech segment.

[0072] Sub-step 5012: extract the low-energy speech segment of the target user's reserved training speech segment.

[0073] Extract the reserved training speech of the target user to obtain the low-energy speech segment of the reserved training speech, and use the short-term energy algorithm to detect the low-energy speech seg...

Embodiment 3

[0143] see Figure 7 , shows a structural block diagram of a voice playback device in Embodiment 3 of the present application, which may specifically include: a user channel module 701, configured to establish a user channel model according to the reserved training voice of the target user.

[0144] Calculation module 702, configured to calculate the trust score of the speech to be recognized on the channel model of the target user.

[0145] The first judging module 703 is configured to determine that the speech to be recognized is replayed if the trust score is less than the set threshold, and return authentication failure; otherwise, pass the replay detection.

[0146] Preferably, the user channel module includes: a first extraction module, configured to extract a low-energy speech segment of the target user's reserved training speech.

[0147] The multi-composite acoustic feature module is used to extract the multi-composite acoustic features of the low-energy speech segme...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application discloses a voice playback detection method and device. The method comprises the steps of establishing a user channel model according to the reserved training voice of a target user; calculating the trust degree score of the to-be-recognized voice on the user channel model; if the trust degree score is less than a set threshold value, determining that the to-be-recognized voice needs to play back, and the return authentication is unsuccessful; on the contrary, passing the playback detection, thereby solving the voice playback attack problem in a current voiceprint recognition technology.

Description

technical field [0001] The present application relates to the technical field of computer information services, in particular to a voice playback detection method and device. Background technique [0002] Speaker recognition technology, also known as voiceprint recognition technology, is mainly based on the information of the speaker's personality characteristics contained in the voice, using computers and various information recognition technologies to automatically realize the confirmation of the speaker's identity. [0003] In recent years, with the rapid development of the Internet, voice as a non-contact information carrier, people can rely on various mobile terminal devices, such as: mobile phones, microphones and IP phones, to complete voice collection anytime and anywhere, and through the network Transmission and background server to realize human-computer interaction and speaker identification. [0004] With the advent of the mobile Internet era, while providing co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/60G10L15/06G10L15/01G10L19/02

Inventor 郑方李蓝天邬晓钧王小钢刘乐

Owner TSINGHUA UNIV

Voice playback detection method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology