Convolutional neural network model-based violence and terrorism video detection method

A convolutional neural network and video detection technology, applied in the fields of computer vision and machine learning, can solve the problems of limited description ability, low accuracy and recall rate of detection work, and difficulty in fully and accurately describing the content of video images, so as to improve accuracy and Recall rate, effect of excellent detection performance

Active Publication Date: 2017-05-31
XIAMEN UNIV
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The existing various video image feature descriptors (such as sift, gist, mser, hessian, etc.) have limited description capabilities, and it is difficult to fully and accurately describe the content of video images, especially in violent and terrorist videos

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network model-based violence and terrorism video detection method
  • Convolutional neural network model-based violence and terrorism video detection method
  • Convolutional neural network model-based violence and terrorism video detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The following embodiments will further illustrate the present invention in conjunction with the accompanying drawings.

[0030] For the correspondence relationship between each layer of the convolutional neural network and the various hierarchical features of the image in the embodiment of the present invention, see figure 1 , the network structure of the present invention:

[0031] There are nine layers in the network.

[0032] The input layer of the first layer inputs an RGB three-channel image.

[0033] The second layer is the convolutional layer. There are a total of 96 convolutional kernels. Each convolutional kernel has a size of 11×11 and a convolution step of 4. The output feature map is followed by an activation layer, a downsampling layer, and a normalization layer. layer. The activation layer activation function uses the ReLU function. The sampling method of the sampling layer is maximum sampling, the sampling kernel is 3×3, and the step size is 2. The n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a convolutional neural network model-based violence and terrorism video detection method, and relates to computer vision and machine learning. The method comprises the following steps of 1) training a deep neural network model; and 2) detecting violence and terrorism videos online. By utilizing a low-level feature of a deep learning model combination, a more abstract high-level representation attribute or feature is formed to discover distributed feature representation of data. Video image feature descriptors with good description capabilities can be obtained through the model. The feature descriptors cover feature information, at all levels from low to high, of video images, so that the detection precision and recall rate of violence and terrorism videos are greatly improved and increased. A deep convolutional network is trained through a small amount of samples to obtain excellent detection performance. The detection precision of terrorism pictures reaches more than 99%, and the recall rate of the terrorism pictures reaches more than 98%. The detection precision of terrorism videos reaches 95%, and the recall rate of the terrorism videos reaches 99%. The training process is free from artificial participation, and massive data is generated automatically according to a small amount of the samples.

Description

technical field [0001] The invention relates to computer vision and machine learning, in particular to a method for detecting violent and terrorist videos based on a convolutional neural network model. Background technique [0002] The rampant terrorism has brought great disaster and pain to people all over the world. It has caused heavy casualties and property losses, hindered national security and social stability, and its harmfulness far exceeds that of ordinary criminal crimes. One of the main channels through which terrorist ideas are disseminated is via video images. At present, the detection of terrorist videos is mainly through manual review and labeling, which consumes a lot of financial and material resources. If the video or image file is re-edited (such as remake, transcription), manual review is required again, which is very inefficient. Therefore, in the face of the Internet with an increasing amount of data, a new type of technology is needed to automatical...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46G06N3/02
CPCG06N3/02G06V20/41G06V20/46G06V10/462
Inventor 纪荣嵘林贤明沈云航
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products