A behavior recognition method, device, apparatus and storage medium

By fusing the initial motion vector difference of the video stream and image data obtained in streaming media behavior detection, the problems of long recognition cycle and resource redundancy in the existing technology are solved, realizing real-time streaming behavior recognition and efficient sharing of computing resources.

CN116994333BActive Publication Date: 2026-06-19CHINA MOBILE CHENGDU INFORMATION & TELECOMM TECH CO LTD +1

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
CHINA MOBILE CHENGDU INFORMATION & TELECOMM TECH CO LTD
Filing Date
2023-07-25
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

Existing streaming media behavior detection methods require the acquisition of keyframes before decoding and encoding, resulting in long recognition cycles and an inability to achieve real-time streaming recognition. Furthermore, the lack of resource sharing leads to redundancy and delays in the recognition process.

Method used

By acquiring the initial motion vector difference of video stream data and image data, and performing fusion processing, end-to-end behavior recognition is achieved using deconvolution decoding and convolution processing, sharing computing resources and reducing unnecessary encoding and decoding steps.

🎯Benefits of technology

It achieves real-time processing and real-time output of behavior recognition, reducing the waste of computing resources, improving recognition efficiency, and reducing recognition time.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
Patent Text Reader

Abstract

This invention discloses a behavior recognition method, apparatus, device, and storage medium. The method includes: obtaining an initial motion vector difference between a first video frame and a second video frame based on video stream data corresponding to a behavior to be recognized; the first video frame being the preceding video frame of the second video frame; if the behavior to be recognized satisfies preset conditions, obtaining first image data corresponding to the behavior to be recognized that satisfies the preset conditions; fusing the initial motion vector difference and the first image data to obtain target information; and performing behavior recognition on the target information to obtain a recognition result corresponding to the behavior to be recognized.
Need to check novelty before this filing date? Find Prior Art