Voice playback detection method and device
A detection method and voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as voice replay, and achieve the effect of avoiding replay attacks
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] refer to figure 2 , which shows a flow chart of a voice playback detection method described in Embodiment 1 of the present application, specifically including:
[0052] Step 201: Establish a user channel model according to the reserved training voice of the target user.
[0053] The reserved training voice of the target user is acquired in advance, and a user channel model is established according to the acquired reserved training voice of the target user.
[0054] The reserved training voice can be obtained from the background server or the client of the target user, or can be obtained in other ways, which is not specifically limited in this application.
[0055] Step 202: Calculate the trust score of the speech to be recognized on the user channel model.
[0056] This application uses the user channel model to score the trust degree of the speech to be recognized input by the user terminal, obtain the trust degree score of the speech to be recognized, and judge whe...
Embodiment 2
[0068] refer to Figure 5 , which shows a flow chart of a voice playback detection method described in Embodiment 2 of the present application, specifically including:
[0069] Step 501: Establish a user channel model according to the reserved training voice of the target user.
[0070] Step 501 includes the following sub-steps:
[0071] Sub-step 5011: Calculate the sum of the squares of the sampling values of the currently reserved training speech segment to obtain the energy of the currently reserved training speech segment. If the energy is lower than the set threshold, the training speech segment is determined to be a low-energy speech segment.
[0072] Sub-step 5012: extract the low-energy speech segment of the target user's reserved training speech segment.
[0073] Extract the reserved training speech of the target user to obtain the low-energy speech segment of the reserved training speech, and use the short-term energy algorithm to detect the low-energy speech seg...
Embodiment 3
[0143] see Figure 7 , shows a structural block diagram of a voice playback device in Embodiment 3 of the present application, which may specifically include: a user channel module 701, configured to establish a user channel model according to the reserved training voice of the target user.
[0144] Calculation module 702, configured to calculate the trust score of the speech to be recognized on the channel model of the target user.
[0145] The first judging module 703 is configured to determine that the speech to be recognized is replayed if the trust score is less than the set threshold, and return authentication failure; otherwise, pass the replay detection.
[0146] Preferably, the user channel module includes: a first extraction module, configured to extract a low-energy speech segment of the target user's reserved training speech.
[0147] The multi-composite acoustic feature module is used to extract the multi-composite acoustic features of the low-energy speech segme...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com