Convolutional neural network model-based violence and terrorism video detection method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional neural network and video detection technology, applied in the fields of computer vision and machine learning, can solve the problems of limited description ability, low accuracy and recall rate of detection work, and difficulty in fully and accurately describing the content of video images, so as to improve accuracy and Recall rate, effect of excellent detection performance

Active Publication Date: 2017-05-31

XIAMEN UNIV

View PDF2 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The existing various video image feature descriptors (such as sift, gist, mser, hessian, etc.) have limited description capabilities, and it is difficult to fully and accurately describe the content of video images, especially in violent and terrorist videos that need to be detected for specific targets , resulting in relatively low precision and recall of the detection work

so in most cases only manual review

Manual review is time-consuming and laborious, which is very limited in the current era of information explosion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The following embodiments will further illustrate the present invention in conjunction with the accompanying drawings.

[0030] For the correspondence relationship between each layer of the convolutional neural network and the various hierarchical features of the image in the embodiment of the present invention, see figure 1 , the network structure of the present invention:

[0031] There are nine layers in the network.

[0032] The input layer of the first layer inputs an RGB three-channel image.

[0033] The second layer is the convolutional layer. There are a total of 96 convolutional kernels. Each convolutional kernel has a size of 11×11 and a convolution step of 4. The output feature map is followed by an activation layer, a downsampling layer, and a normalization layer. layer. The activation layer activation function uses the ReLU function. The sampling method of the sampling layer is maximum sampling, the sampling kernel is 3×3, and the step size is 2. The n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a convolutional neural network model-based violence and terrorism video detection method, and relates to computer vision and machine learning. The method comprises the following steps of 1) training a deep neural network model; and 2) detecting violence and terrorism videos online. By utilizing a low-level feature of a deep learning model combination, a more abstract high-level representation attribute or feature is formed to discover distributed feature representation of data. Video image feature descriptors with good description capabilities can be obtained through the model. The feature descriptors cover feature information, at all levels from low to high, of video images, so that the detection precision and recall rate of violence and terrorism videos are greatly improved and increased. A deep convolutional network is trained through a small amount of samples to obtain excellent detection performance. The detection precision of terrorism pictures reaches more than 99%, and the recall rate of the terrorism pictures reaches more than 98%. The detection precision of terrorism videos reaches 95%, and the recall rate of the terrorism videos reaches 99%. The training process is free from artificial participation, and massive data is generated automatically according to a small amount of the samples.

Description

technical field [0001] The invention relates to computer vision and machine learning, in particular to a method for detecting violent and terrorist videos based on a convolutional neural network model. Background technique [0002] The rampant terrorism has brought great disaster and pain to people all over the world. It has caused heavy casualties and property losses, hindered national security and social stability, and its harmfulness far exceeds that of ordinary criminal crimes. One of the main channels through which terrorist ideas are disseminated is via video images. At present, the detection of terrorist videos is mainly through manual review and labeling, which consumes a lot of financial and material resources. If the video or image file is re-edited (such as remake, transcription), manual review is required again, which is very inefficient. Therefore, in the face of the Internet with an increasing amount of data, a new type of technology is needed to automatical...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/46G06N3/02

CPCG06N3/02G06V20/41G06V20/46G06V10/462

Inventor 纪荣嵘林贤明沈云航

Owner XIAMEN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Convolutional neural network model-based violence and terrorism video detection method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology