Video description data processing method and device and storage medium

A video description and data processing technology, applied in the field of image processing, can solve problems such as inappropriateness, mismatch between training and testing, and reduce model accuracy, so as to improve accuracy and solve inconsistent data distribution.

Pending Publication Date: 2022-04-22
桂林远望智能通信科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, the basic training model only uses real labels as data input during training, and only generates values ​​as input during testing, which causes a mismatch between training and testing.
At the same time, the descriptions and real labels generated by the existing basic models are often not appropriate enough, which reduces the accuracy of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video description data processing method and device and storage medium
  • Video description data processing method and device and storage medium
  • Video description data processing method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The following principles and features of the present invention are described in conjunction with the accompanying drawings, the examples given are for illustrative purposes only and are not intended to limit the scope of the present invention.

[0022] Figure 1 A schematic diagram of a video description of a data processing method provided for an embodiment of the present invention.

[0023] as Figure 1 As shown, a video describes a data processing method, comprising the following steps:

[0024] S1: Import video data and build a video description model, the video description model includes an encoder and multiple sequentially arranged LSTM long short-term memory network;

[0025] S2: The video data is encoded by the encoder to obtain a visual feature matrix, the visual feature matrix comprises a plurality of visual feature vectors corresponding to the plurality of long short-term memory networks of LSTM, respectively;

[0026] S3: The real word vector corresponding to t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a video description data processing method and device and a storage medium, and belongs to the technical field of image processing, and the method comprises the steps: S1, importing video data, and constructing an encoder and a plurality of sequentially arranged LSTM (Long Short-Term Memory) networks; s2, encoding the video data through an encoder to obtain a visual feature vector; s3, real word vectors are imported, the LSTM long short-term memory networks, the visual feature vectors and the real word vectors are taken as one group, and each group of LSTM long short-term memory networks is sequentially judged and analyzed to obtain video description information; s4, performing loss analysis on the video description information to obtain a target video description model; and S5, importing to-be-tested video data, and performing video description on the to-be-tested video data through the target video description model to obtain a video description result. According to the method, the problem of inconsistent data distribution is solved, the generated words can be closer to real tags, and the accuracy of description generation is further improved.

Description

Technical field [0001] The present invention relates primarily to the field of image processing technology, specifically to a video description data processing method, apparatus and storage medium. Background [0002] At present, the basic training model will only use the real label as the data input when training, and the generated value can only be used as the input when testing, which causes a mismatch between training and testing. At the same time, the description and real labels generated by the existing basic model are often not appropriate enough, which reduces the accuracy of the model. Contents of the Invention [0003] The technical problem to be solved in the present invention is to provide a video description of the data processing method, apparatus and storage medium for the deficiencies of prior art. [0004] The present invention solves the technical solution of the above technical problems as follows: a video description of a data processing method, comprising t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/78G06F16/783G06N3/04G06N3/08
CPCG06F16/7867G06F16/7847G06N3/049G06N3/08G06N3/045
Inventor 蔡晓东王湘晴
Owner 桂林远望智能通信科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products