Method and apparatus for determining counterfeit video on basis of multi-modal features, and device and medium

By extracting frame and audio features from videos, dynamically adjusting weights, and performing multi-level anomaly detection, the problem of low accuracy in detecting forged videos in existing technologies is solved, and efficient and accurate identification of complex forged videos is achieved.

WO2026129745A1PCT designated stage Publication Date: 2026-06-25CHINA TELECOM ARTIFICIAL INTELLIGENCE TECHNOLOGY (BEIJING) CO LTD

Patent Information

Authority / Receiving Office
WO · WO
Patent Type
Applications
Current Assignee / Owner
CHINA TELECOM ARTIFICIAL INTELLIGENCE TECHNOLOGY (BEIJING) CO LTD
Filing Date
2025-09-05
Publication Date
2026-06-25

Smart Images

  • Figure CN2025119425_25062026_PF_FP_ABST
    Figure CN2025119425_25062026_PF_FP_ABST
Patent Text Reader

Abstract

The present application relates to the technical field of artificial intelligence. Disclosed are a method and apparatus for determining a counterfeit video on the basis of multi-modal features, and a device and a medium. The method comprises: extracting picture frames and audio from a video; identifying objects from the picture frames, and then determining picture features of the objects; identifying audio features of the audio; on the basis of the picture features and the audio features, dynamically outputting a first weight, which relates to temporal synchronization between the picture frames and the audio, a second weight, which relates to spatio-temporal consistency between the picture frames and the audio, and a third weight, which relates to detecting whether features at different levels are abnormal, and extracting a time-series feature of the video; and on the basis of the first weight, the second weight, the third weight and the time-series feature, determining whether the video is counterfeit.
Need to check novelty before this filing date? Find Prior Art