Unlock instant, AI-driven research and patent intelligence for your innovation.

Transformer-Based Video Summarization Approach

A technology of video summarization and converter, which is applied in the field of video summarization based on converters, can solve problems such as the difficulty of capturing long-term dependencies, missing sequence dependencies, and difficult training of cyclic neural networks, and achieve fast training timeliness and sequence information Complete, model-simple effect

Inactive Publication Date: 2021-05-18
NORTHWESTERN POLYTECHNICAL UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, two LSTM networks are used in the literature "Ke Zhang, Wei-Lun Chao, FeiSha, et al. Video Summarization with Long Short-Term Memory[C] / / European Conference on Computer Vision. Springer, Cham, 2016.", one from Front to back, one extracts the sequence information of the video frame from the back to the front and predicts the importance score of the video frame. The network structure is simple and can extract key sequence information, but it is difficult for the cyclic neural network to capture long-term dependencies. When processing long video information, it is easy to lose early sequence dependencies; the document "Ji, Zhong, Xiong, Kailin, Pang, Yanwei, etal.Video Summarization with Attention-Based Encoder-Decoder Networks[J].2017." uses codec The codec structure is used to extract video key frames. Although the attention mechanism has been added and good results have been achieved, the codec still uses the LSTM network, and its complexity is related to the length of the video. There are problems of difficulty in parallel training and time-consuming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Transformer-Based Video Summarization Approach
  • Transformer-Based Video Summarization Approach
  • Transformer-Based Video Summarization Approach

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0019] Such as figure 1 As shown, the present invention provides a kind of converter-based video summarization method, and its specific implementation process is as follows:

[0020] 1. Data processing

[0021] Downsample the video in the selected dataset, and then use a pre-trained neural network to extract the feature vector h for each frame of the video f ∈R d , f is the frame number, f=1,2,...,F, F is the total length of the video after downsampling, d represents the length of the feature vector; the feature vectors of all frames of a video and the corresponding importance scores constitute training A sample in the set; the selected data set includes TvSum and SumMe, which contain several videos and the importance score s' of manual labeling for each frame f ;

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a converter-based method for extracting video summaries. First, the selected data set is processed to obtain the training data set of the model; then, a video summarization converter neural network model including a self-attention mechanism is constructed, and the self-attention mechanism is used to calculate the similarity between video frames, and through Add the importance score of the previous frame to enhance the ability of the model to capture the global dependency of the video frame sequence, and use the training data set to train the model; finally, use the trained model to process the video data to be processed to obtain the importance of each frame Score, according to the score selection to get the video summary. The present invention can well capture the timing information between video frame sequences, and then can well predict the importance of video frames in the form of scores, and the model network of the present invention can parallelize frame sequences Training has the advantages of fast training timeliness and complete and short video summaries.

Description

technical field [0001] The invention belongs to the technical fields of computer vision and deep learning representation, and in particular relates to a converter-based video summarization method. Background technique [0002] With the rapid development of cameras and video sharing technologies, the number of videos is showing explosive growth. In the face of massive video data, how to efficiently extract useful information from videos has become an important issue. As an important technology to solve this problem, video summarization technology aims to generate a complete and short summary video for the original video. The summary video can convey the information expressed by the original video on the basis of short duration. Hot spots in vision and other fields. Video summary technology comprehensively uses machine learning, artificial intelligence and other technologies, and plays an important role in video retrieval, storage, recommendation and other aspects. [0003]...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04N21/8549G06N3/04G06N3/08
CPCG06N3/04G06N3/08H04N21/8549
Inventor 梁国强张艳宁吕艳兵李书成吉时雨
Owner NORTHWESTERN POLYTECHNICAL UNIV