Unlock instant, AI-driven research and patent intelligence for your innovation.

Video region-of-interest (ROI) encoding method based on microphone array assistance

A technology of region of interest and microphone array, which is applied in the coding field of video region of interest based on microphone array assistance, can solve the problem of not considering audio signal information, etc., and achieve the effect of good subjective viewing effect

Active Publication Date: 2015-02-25
XIAN JIAOTONG LIVERPOOL UNIV
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In most cases, the study of video coding schemes does not consider the information provided by the audio signal itself.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video region-of-interest (ROI) encoding method based on microphone array assistance
  • Video region-of-interest (ROI) encoding method based on microphone array assistance
  • Video region-of-interest (ROI) encoding method based on microphone array assistance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0037] A method based on microphone array-assisted video region of interest extraction and encoding is characterized in that in the method, in terms of hardware: in traditional shooting equipment, a microphone array is required, that is, the support of two or more microphones; in software Aspect: It is necessary to obtain the spatial direction of the sound through the sound direction detection algorithm, and then obtain the region of interest through the auto-focus system, or use a related algorithm to obtain the region of interest, and then encode the region of interest and non-interest region through different coding strategies .

[0038] Hardware aspect: the application of this method in smartphones, such as figure 2 shown. On this phone hardware, three microphones are required. Wherein the call microphone 1 is arranged at the lower end of the housing, and the first noise reduction microphone 2 and the second noise reduction microphone 4 are arranged at both ends of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video region-of-interest (ROI) encoding method based on microphone array assistance. In the process of video shooting, the space direction of sound in the shot video is acquired through a microphone array (including two or more microphones), then an automatic focusing point in the shot video is determined through the space direction, an ROI of a video frame is determined through the focusing point, and finally the video frame is encoded through different encoding strategies. Through the video ROI encoding method based on microphone array assistance, user shooting experience can be improved, the focusing point and the ROI are dynamically selected, and finally the subjective watching experience of the video is improved by redistributing code streams.

Description

technical field [0001] The invention relates to a video encoding method based on a region of interest, in particular to a method for extracting and encoding a region of interest in a video aided by a microphone array. Background technique [0002] Currently, a high-definition video format (High Definition, HD) is increasingly used in various video recording and real-time video communications. However, storing and transmitting HD video streams brings great challenges to storage devices and network bandwidth. Especially for portable recording devices, such as smart phones and DV machines, the wide use of HD video is limited due to their limited storage space. An effective solution is to divide the region of interest and the region of non-interest in the video, use different coding strategies for different regions, and use more bit rates to encode the region of interest, and vice versa. [0003] In traditional ROI-based video coding methods, most of them use face recognition,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04N19/167H04N5/232
CPCH04N23/67
Inventor 罗天明程飞
Owner XIAN JIAOTONG LIVERPOOL UNIV