Deep learning based video retrieval method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A deep learning and video technology, applied in the field of computer vision, can solve problems such as inaccurate description of high-dimensional feature differences, and achieve the effects of improving matching accuracy, precise retrieval, and avoiding false detection and missed detection

Active Publication Date: 2018-06-29

SOUTH CHINA UNIV OF TECH

View PDF9 Cites 21 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In addition, because video is a complex data structure, existing feature matching methods cannot accurately describe the differences between high-dimensional features, and similarity learning can improve the performance of similarity measurement methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The specific implementation of the present invention will be further described below in conjunction with the accompanying drawings, but the implementation and protection of the present invention are not limited thereto. realized or understood.

[0030] The embodiment of the present invention provides a video retrieval method based on deep learning, the steps are as follows: figure 1 Shown; The concrete implementation steps of described method are as follows:

[0031] Network training part:

[0032] Step 1) Construct a video preprocessing network, using the network structure of Inception Net V3. Inception Net is a 22-layer deep convolutional neural network. The last layer of the network has the best classification effect, so the output of the last layer is selected as the input feature vector.

[0033] Step 2) Train the video preprocessing network. Use YouTuBe-8M as the training data set, which has 8 million videos and a total of 4800 label categories. In order to e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a deep learning based video retrieval method which mainly comprises the following parts: performing video pre-processing by using a convolutional neural network; extracting feature vectors of the video after pre-processing by using a long short-term memory; and finally learning by a similarity learning algorithm to obtain a distance calculation method, and performing similarity calculation according to the method and ranking, so as to obtain a video retrieval result. According to the method disclosed by the invention, scene segmentation and key frame selection are performed by the convolutional neural network, and high-level semantics representing video is extracted, so as to acquire a proper quantity of key frame sequences, and effectively avoid false detection andmissing inspection in shot segmentation. According to method disclosed by the invention, time-order characteristics of the video are extracted by the long short-term memory, so as to obtain a more accurate retrieval result. By similarity learning and a text based matching method, matching accuracy of a similarity measurement method can be promoted. By adopting the method disclosed by the invention, accurate retrieval on a large scale of videos can be realized.

Description

technical field [0001] The invention belongs to the field of computer vision, and especially designs a video retrieval method utilizing deep learning and digital processing technology. Background technique [0002] In recent years, the Internet and multimedia technology have been widely developed and used, and the trend has swept the world. Massive video data is generated in people's daily life, work and study. Facing the explosive growth of multimedia data, we increasingly need a method that can accurately and effectively retrieve and manage massive video. [0003] A complete video retrieval process usually includes three main steps: video preprocessing, that is, the process of removing redundant frames, including shot detection and key frame extraction; video feature extraction; feature matching, that is, similarity calculation. In the field of video preprocessing, existing technologies mainly use pixel difference method, histogram method and edge detection method to perf...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30G06N3/04

CPCG06F16/783G06N3/045

Inventor 丁泉龙廖奕铖韦岗李杰

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Deep learning based video retrieval method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology