A method and system for counterfeit audio authentication
By performing frame-by-frame windowing, short-time Fourier transform, and multimodal feature fusion on audio data, the long-distance feature dependencies of audio are captured, and a dual identification metric is constructed. This solves the problem of insufficient accuracy in identifying fake audio in existing technologies and achieves efficient and accurate identification of fake audio.
CN122090874BActive Publication Date: 2026-06-26HANG ZHOU LING XIN SHU KE XIN XI JI SHU YOU XIAN GONG SI
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- HANG ZHOU LING XIN SHU KE XIN XI JI SHU YOU XIAN GONG SI
- Filing Date
- 2026-04-21
- Publication Date
- 2026-06-26
Smart Images

Figure CN122090874B_ABST
Abstract
The present application relates to the technical field of audio discrimination, and discloses a counterfeit audio discrimination method and system, the method comprising: performing frame windowing processing on original audio data to obtain frame sequence audio data; performing short-time Fourier transform on the frame sequence audio data to obtain frequency domain representation data and an initial acoustic feature vector; performing deep feature reconstruction on the initial acoustic feature vector, and performing multi-modal feature fusion on the reconstructed feature tensor and the frame sequence audio data to obtain a joint discrimination feature set; performing timing context analysis on the joint discrimination feature set to capture feature dependency relationships of the frame sequence audio data, and obtaining a context-enhanced feature sequence; quantifying the cosine similarity between the context-enhanced feature sequence and a real voiceprint template, and analyzing the feature mutation amplitude of the context-enhanced feature sequence to obtain a double discrimination metric; and performing comprehensive scoring on the double discrimination metric to obtain an audio discrimination conclusion; the present application can improve the efficiency of counterfeit audio discrimination.
Need to check novelty before this filing date? Find Prior Art