Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An Audio Object Coding Method Adapted to Personalized Interactive System

An interactive system and object coding technology, which is applied in speech analysis, instruments, etc., can solve the problems of unguaranteed audio object decoding sound quality, aliasing and distortion, etc., and achieve the effect of reducing bit rate, good sound quality, and meeting user needs

Active Publication Date: 2022-02-01
WUHAN UNIV
View PDF16 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods can only improve the listening experience of a certain target object, other objects still have the problem of aliasing and distortion, and cannot guarantee that each audio object has a better decoding sound quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Audio Object Coding Method Adapted to Personalized Interactive System
  • An Audio Object Coding Method Adapted to Personalized Interactive System
  • An Audio Object Coding Method Adapted to Personalized Interactive System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to facilitate those skilled in the art to understand and implement the present invention, the technical solution of the present invention will be further described below in conjunction with the accompanying drawings and specific implementation examples. It should be understood that the implementation examples described here are only for illustration and explanation of the present invention, and are not intended to To limit the present invention:

[0022] The present invention conducts further research on the basis of the existing audio object coding method, and proposes a multi-step downmixing and reconstruction audio object coding and decoding method. First, according to the frequency domain energy of the object, the optimal coding sequence is studied, and the object that needs to be coded and calculated side information at each step is determined, and finally the residual information of each object can be obtained, which can effectively reduce the signal disto...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio object encoding method suitable for a personalized interactive system. In the encoding stage, the invention first transforms multiple audio objects to be encoded from the time domain into the frequency domain by frame division and windowing; according to each object Sort the energy of each step to determine the coding order of the objects; extract the coding objects and the corresponding downmix signals of each step in a loop, and calculate the parameters and residuals of each step accordingly; use singular value decomposition to decompose and compress the large-sized residual matrix; Combining the final mixed signal, parameter and residual decomposition matrix into a code stream. In the decoding stage, the decomposition matrix is ​​used to reconstruct the residual; then, according to the residual and parameters of each object, the objects are decoded and reconstructed from the downmix signal step by step. The invention can simultaneously ensure low code rate and high quality reconstruction of each audio object through sequential multi-step codec and residual decomposition.

Description

technical field [0001] The invention belongs to the technical field of digital audio signal processing, and in particular relates to a multi-step step-by-step downmixing and reconstruction audio object encoding and decoding method, which is suitable for a personalized interactive system of spatial audio and allows users to adjust audio objects according to their own needs. Background technique [0002] The spatial audio technology based on channel coding can realize the coding and reconstruction of three-bit audio scenes, which can provide more immersive listening experience than mono or stereo audio technology, such as MPEG spatial audio coding, NHK22.2 speaker array, etc. , and thus become more and more popular. However, the traditional channel-based spatial audio system still has limitations, and its flexibility is low, which is difficult to meet the audio service system that supports personalized interactive functions. Therefore, the new generation of audio coding techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/008G10L19/02
CPCG10L19/008G10L19/02
Inventor 胡瑞敏胡晨昊王晓晨武庭照吴玉林
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products