Video data augmentation method, device, electronic device and readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A video data and video technology, applied in deep learning, computer vision, and artificial intelligence fields, to improve training effects and model performance, increase diversity, and avoid overfitting

Active Publication Date: 2022-05-03

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In model training, the current research mainly focuses on how to extract the spatio-temporal features of videos, such as optimizing the design of the network structure, improving the ability to use video spatio-temporal information, especially temporal information, etc., and the training strategy of the network is especially It is video data augmentation but rarely mentioned and researched

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0028] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0029] In addition, it should be understood that the term "and / or" in this article is only an association relationship describing associated objects, which means that there may be three relationships, for example, A and / or B may mean: A exists alone, and A exists at the same time. and B, there are three cases of B alone. In addition, the character " / " in this article g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The disclosure discloses a video data augmentation method, device, electronic equipment, and readable storage medium, and relates to artificial intelligence fields such as deep learning and computer vision. The method may include: during the model training process, for any training batch M pieces of video data, M is a positive integer greater than one, and the following processing is performed respectively: M pieces of video data in the original sequence are used to form the first video sequence; the M pieces of video data in the first video sequence are randomly sorted to obtain the first video sequence Two video sequences: respectively mixing each video data in the first video sequence with corresponding video data in the second video sequence to obtain M mixed video data for model training. Applying the solution described in the present disclosure can improve the model training effect and model performance.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence technology, and in particular to video data augmentation methods, devices, electronic equipment, and readable storage media in the fields of deep learning and computer vision. Background technique [0002] Currently, when performing video classification, deep learning techniques are usually combined, for example, video classification can be performed using a trained video classification model. [0003] In model training, the current research mainly focuses on how to extract the spatio-temporal features of videos, such as optimizing the design of the network structure, improving the ability to use video spatio-temporal information, especially temporal information, etc., and the training strategy of the network is especially It is the aspect of video data augmentation, but it is rarely mentioned and studied. Contents of the invention [0004] The disclosure provides a video data au...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06V10/774G06V20/40G06K9/62

CPCG06V20/41G06F18/214

Inventor 王世鹏黄军程军胡晓光

Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD

Video data augmentation method, device, electronic device and readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology