Video scene detecting and labeling method and system

A video scene and video technology, applied in the field of video processing, can solve the problems of unsuitable video scene variable length duration, large amount of calculation consumption, poor generalization ability, etc., to solve the problem of multi-label labeling, error propagation and Effect of huge computational cost problem

Pending Publication Date: 2022-04-12
XI AN JIAOTONG UNIV
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But this will face the following three problems: 1) error propagation
2) Huge amount of calculation
If the scene detection is directly based on the frame features, it will consume a lot of computation during the training process
3) Poor generalization ability
Matrix inference requires a fixed length in the time dimension, which is not suitable for annotating variable-length durations of video scenes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video scene detecting and labeling method and system
  • Video scene detecting and labeling method and system
  • Video scene detecting and labeling method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention is described in further detail below in conjunction with accompanying drawing:

[0029] In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

[0030] It should be noted that the terms "comprising" and "having" and any variations thereof are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a series of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a video scene detection labeling method and system, and the method comprises the steps: obtaining the modal features of a video, an audio and a text through a pre-training model according to a modal information source embedded by the input video, the audio and the text, carrying out the alignment and fusion of the obtained modal features of the video, the audio and the text, and forming a window basic cross-modal representation, according to the multi-temporal attention and the difference between the adjacent windows, the basic cross-modal representation of the windows is evolved into self-adaptive context sensing representation, the scene is detected according to the obtained self-adaptive context sensing representation, and the attributes of the windows are determined through a window attribute classifier; obtaining an accurate position of a scene boundary in the window through a position offset regression device; and based on the obtained scene boundaries, specifying a plurality of labels for each scene to realize scene labeling, attributing scene detection into window attribute classification and position offset regression, and solving the multi-label labeling problem through integrated learning of two-stage classifiers. The problems of error propagation and huge calculation cost are solved through a unified network of cross-modal clues; scene detection is attributed to window attribute classification and position offset regression, and the multi-label labeling problem is solved through ensemble learning of two-stage classifiers.

Description

technical field [0001] The invention belongs to the field of video processing, and in particular relates to a video scene detection and labeling method and system. The invention is based on the video scene detection and labeling of a multi-modal adaptive context network. Background technique [0002] With the rapid development of 5G technology, video advertising has seen a huge growth in short video applications. In the creative, distribution, and strategy of the advertising ecosystem, a deep understanding of its content is becoming more and more important and demanding. As a key step in the semantic understanding of video advertisements, scene detection and annotation aims to temporally parse a video into different scenes and predict each scene to be annotated in different dimensions, such as presentation form, style, and location. Various potential applications have emerged, including video ad insertion, video summarization, video indexing and retrieval, etc. [0003] In...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V20/40G06N3/08
Inventor 徐亦飞桑维光罗海伦李斌徐武将朱利
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products