Video content positioning method based on feature fusion and cascade learning
A feature fusion and video content technology, applied in the field of machine vision and deep learning, can solve the problems of loss of complementary high-level semantic information, difficulty in realizing accurate positioning of video content, and reducing the accuracy of video content positioning
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0042] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.
[0043] The invention provides a video content location method based on feature fusion. Based on feature fusion and cascaded learning, cascaded neural networks are used to perform video feature extraction, feature fusion, and content location, which can solve the problem of video images and sounds. Complementary high-level semantic information loss problem, to achieve precise positioning of video content.
[0044] Such as figure 1 As shown, the video content positioning method based on feature fusion and cascade learning according to the present invention is used to accurately position the video. The video includes features of multiple modalities, such as image, sound, and optical flow. Assume that only sound and RGB image are used this time, so the following n is 2; the specific implementation incl...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


