An Audio Object Coding Method Adapted to Personalized Interactive System

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An interactive system and object coding technology, which is applied in speech analysis, instruments, etc., can solve the problems of unguaranteed audio object decoding sound quality, aliasing and distortion, etc., and achieve the effect of reducing bit rate, good sound quality, and meeting user needs

Active Publication Date: 2022-02-01

WUHAN UNIV

View PDF16 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, these methods can only improve the listening experience of a certain target object, other objects still have the problem of aliasing and distortion, and cannot guarantee that each audio object has a better decoding sound quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] In order to facilitate those skilled in the art to understand and implement the present invention, the technical solution of the present invention will be further described below in conjunction with the accompanying drawings and specific implementation examples. It should be understood that the implementation examples described here are only for illustration and explanation of the present invention, and are not intended to To limit the present invention:

[0022] The present invention conducts further research on the basis of the existing audio object coding method, and proposes a multi-step downmixing and reconstruction audio object coding and decoding method. First, according to the frequency domain energy of the object, the optimal coding sequence is studied, and the object that needs to be coded and calculated side information at each step is determined, and finally the residual information of each object can be obtained, which can effectively reduce the signal disto...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an audio object encoding method suitable for a personalized interactive system. In the encoding stage, the invention first transforms multiple audio objects to be encoded from the time domain into the frequency domain by frame division and windowing; according to each object Sort the energy of each step to determine the coding order of the objects; extract the coding objects and the corresponding downmix signals of each step in a loop, and calculate the parameters and residuals of each step accordingly; use singular value decomposition to decompose and compress the large-sized residual matrix; Combining the final mixed signal, parameter and residual decomposition matrix into a code stream. In the decoding stage, the decomposition matrix is used to reconstruct the residual; then, according to the residual and parameters of each object, the objects are decoded and reconstructed from the downmix signal step by step. The invention can simultaneously ensure low code rate and high quality reconstruction of each audio object through sequential multi-step codec and residual decomposition.

Description

technical field [0001] The invention belongs to the technical field of digital audio signal processing, and in particular relates to a multi-step step-by-step downmixing and reconstruction audio object encoding and decoding method, which is suitable for a personalized interactive system of spatial audio and allows users to adjust audio objects according to their own needs. Background technique [0002] The spatial audio technology based on channel coding can realize the coding and reconstruction of three-bit audio scenes, which can provide more immersive listening experience than mono or stereo audio technology, such as MPEG spatial audio coding, NHK22.2 speaker array, etc. , and thus become more and more popular. However, the traditional channel-based spatial audio system still has limitations, and its flexibility is low, which is difficult to meet the audio service system that supports personalized interactive functions. Therefore, the new generation of audio coding techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L19/008G10L19/02

CPCG10L19/008G10L19/02

Inventor胡瑞敏胡晨昊王晓晨武庭照吴玉林

OwnerWUHAN UNIV

An Audio Object Coding Method Adapted to Personalized Interactive System

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology