Audio and video processing method, device, equipmentand medium

A technology of audio and video processing and equipment, applied in the computer field, can solve problems such as increased labor costs, low accuracy, and limitations, and achieve the effects of reducing labor costs, improving accuracy and recall, and improving experience

Inactive Publication Date: 2019-01-22
GUANGZHOU BAIGUOYUAN INFORMATION TECH CO LTD
View PDF12 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the accuracy of short video tag classification is limited by the performance of the algorithm. If the algorithm performance is relatively poor, th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio and video processing method, device, equipmentand medium
  • Audio and video processing method, device, equipmentand medium
  • Audio and video processing method, device, equipmentand medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0057] The present invention will be further described in detail below with reference to the drawings and embodiments. It can be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention. In addition, it should be noted that, for ease of description, the drawings only show parts related to the present invention instead of all the structures or components.

[0058] The prior art uses three-dimensional convolution for label classification of video content, and it is necessary to transform a two-dimensional convolutional neural network that processes a single picture into a three-dimensional convolutional neural network that can process multiple pictures, so as to be directly used for image classification convolution Neural network, but the three-dimensional convolution causes the network parameters to be very large, making the training of the network difficult, that is, there is a problem that the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an audio and video processing method, a device, equipment and a medium, which relate to the computer technical field. The method comprisesseparating image frame information andaudio information from the video file; extracting image feature information and audio feature information from the image frame information and the audio information, respectively; fusing the image feature information and the audio feature information into video content feature information; determining a classification result corresponding to the video file according to the video content characteristic information. The invention combines the audio characteristic information in the video and the image characteristic information of the video frame for video classification, improves the accuracyand recall rate of the video classification, thereby reducing the labor cost of the video classification examination.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to an audio and video processing method, device, equipment and medium. Background technique [0002] With the rapid development of computer technology, deep learning technology has made great progress in many fields of image understanding. For example, deep learning technology is applied to tasks such as object classification, object detection, and object segmentation in images. So far, deep learning technology has been very mature in the field of image understanding, and has been gradually applied to video content understanding tasks. However, compared with image content understanding, video content understanding still has a long way to go. In video content understanding tasks, video classification is the most basic task, and the field of video classification has become a hotspot for many researchers. [0003] Specifically, video classification is mainly to classify vid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04N21/234H04N21/239H04N21/439H04N21/44H04N21/466
CPCH04N21/23418H04N21/239H04N21/4394H04N21/44008H04N21/4665
Inventor 刘文奇刘运梁柱锦
Owner GUANGZHOU BAIGUOYUAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products