Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video data augmentation method, device, electronic device and readable storage medium

A video data and video technology, applied in deep learning, computer vision, and artificial intelligence fields, to improve training effects and model performance, increase diversity, and avoid overfitting

Active Publication Date: 2022-05-03
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In model training, the current research mainly focuses on how to extract the spatio-temporal features of videos, such as optimizing the design of the network structure, improving the ability to use video spatio-temporal information, especially temporal information, etc., and the training strategy of the network is especially It is video data augmentation but rarely mentioned and researched

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video data augmentation method, device, electronic device and readable storage medium
  • Video data augmentation method, device, electronic device and readable storage medium
  • Video data augmentation method, device, electronic device and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0029] In addition, it should be understood that the term "and / or" in this article is only an association relationship describing associated objects, which means that there may be three relationships, for example, A and / or B may mean: A exists alone, and A exists at the same time. and B, there are three cases of B alone. In addition, the character " / " in this article g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosure discloses a video data augmentation method, device, electronic equipment, and readable storage medium, and relates to artificial intelligence fields such as deep learning and computer vision. The method may include: during the model training process, for any training batch M pieces of video data, M is a positive integer greater than one, and the following processing is performed respectively: M pieces of video data in the original sequence are used to form the first video sequence; the M pieces of video data in the first video sequence are randomly sorted to obtain the first video sequence Two video sequences: respectively mixing each video data in the first video sequence with corresponding video data in the second video sequence to obtain M mixed video data for model training. Applying the solution described in the present disclosure can improve the model training effect and model performance.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence technology, and in particular to video data augmentation methods, devices, electronic equipment, and readable storage media in the fields of deep learning and computer vision. Background technique [0002] Currently, when performing video classification, deep learning techniques are usually combined, for example, video classification can be performed using a trained video classification model. [0003] In model training, the current research mainly focuses on how to extract the spatio-temporal features of videos, such as optimizing the design of the network structure, improving the ability to use video spatio-temporal information, especially temporal information, etc., and the training strategy of the network is especially It is the aspect of video data augmentation, but it is rarely mentioned and studied. Contents of the invention [0004] The disclosure provides a video data au...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/774G06V20/40G06K9/62
CPCG06V20/41G06F18/214
Inventor 王世鹏黄军程军胡晓光
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products