Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for joint video and audio quality assessment based on neural network

A neural network and quality evaluation technology, applied in the field of multimedia quality evaluation, can solve the problems of lack of objective audio and video joint quality evaluation model, no data collected, no description or report found, etc.

Active Publication Date: 2021-05-07
SHANGHAI JIAOTONG UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally speaking, in order to study the interaction between audiovisual signals and other factors that affect the evaluation of audiovisual quality, it is usually necessary to conduct basic research on multimodal perception, and these studies are usually achieved through some audiovisual experiments, while the current field Intrinsic and objective audio-video joint quality evaluation models are extremely scarce
[0010] At present, there is no description or report of the similar technology of the present invention, and no similar data at home and abroad have been collected yet.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for joint video and audio quality assessment based on neural network
  • Method and device for joint video and audio quality assessment based on neural network
  • Method and device for joint video and audio quality assessment based on neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The following is a detailed description of the embodiments of the present invention: this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific operation processes. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention.

[0047] like figure 1 As shown, an overall flow chart of a neural network-based video and audio joint quality evaluation method is provided for the embodiment of the present invention, and the method includes the following steps:

[0048] The first step includes the following two parts:

[0049] (1) Intercept video image blocks adapted to neural network input from video frames

[0050] Specifically, for a certain reference video frame of each provided reference video, it is necessary t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a neural network-based video and audio joint quality evaluation method, comprising: from the video frame, the video image block adapted to the input of the neural network is intercepted, and the one-dimensional audio signal of the audio segment is converted by short-time Fourier transform For the two-dimensional spectrogram representation, the neural network is used to extract the perceptual quality features from the video image block and the two-dimensional spectrogram respectively, and the extracted audio and video deep neural network perceptual quality features are post-processed to obtain two modes based on The quality features of the deep neural network, the quality features of the two modalities are fused to obtain the joint perceptual quality of video frames and audio clips, and the joint perceptual quality of video frames and audio clips is pooled in the time domain to obtain the joint perceptual quality of the overall audio and video. At the same time, a combined quality evaluation device is provided. The neural network-based video and audio joint quality evaluation method provided by the present invention can effectively evaluate the overall experience quality of audio and video.

Description

technical field [0001] The present invention relates to the technical field of multimedia quality evaluation, in particular to a neural network-based video and audio joint quality evaluation method and device. Background technique [0002] With the progress of society and the development of science and technology, the way people convey information is constantly changing. In particular, the rapid development of information technology has made multimedia represented by video and audio gradually become an indispensable way for people to convey information and communicate. Statistics show that people around the world take more than one trillion photos every year, and other types of multimedia information such as audio and video are also experiencing explosive growth. In this context, related multimedia signal processing technology has also become a research hotspot. Multimedia information may go through various stages such as collection, compression, transmission, processing, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04N17/00H04N21/234H04N21/233H04N21/44H04N21/439H04N21/475G06N3/04
CPCH04N17/00H04N21/23418H04N21/233H04N21/44008H04N21/4394H04N21/4756G06N3/045
Inventor 闵雄阔翟广涛杨小康
Owner SHANGHAI JIAOTONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More