Metadata based on binaural audio packet format and generation method, device and medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An audio package and metadata technology, applied in speech analysis, stereo communication headsets, instruments, etc., can solve the problem that the two-channel speaker system cannot achieve the effect, and achieve the effect of improving the quality

Pending Publication Date: 2022-03-18

赛因芯微(北京)电子科技有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For example, a two-channel speaker system cannot achieve the effect of a surround 5.1 speaker system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0037] The present disclosure provides metadata in an audio package format in a three-dimensional audio model, and describes it in detail.

[0038] In the audio packet format element of the 3D audio production model, the metadata of the audio object and the audio stream data are divided into multiple data blocks according to channels, and these data blocks are called audio packets. These audio packets travel along different paths across one or more networks to be reassembled at their destination. The embodiment of the present disclosure uses the metadata 100 of the audio packet format to describe the structural information of the audio packet format.

[0039] Such as figure 2 As shown, the metadata 100 in the audio packet format includes an attribute area 110 and a sub-element area 120 .

[0040] The attribute area 110 includes an audio packet format identifier 111 and an audio packet format name 112 of the audio packet.

[0041] The audio packet format identifier 111 incl...

Embodiment 2

[0068] The present disclosure also provides a method embodiment inherited from the above embodiment, a method for generating metadata in an audio package format, the explanation based on the same name meaning is the same as the above embodiment, and has the same technical effect as the above embodiment, I won't repeat them here.

[0069] Such as image 3 As shown, a method for generating metadata in an audio package format includes the following steps:

[0070] Step S210, generating metadata in audio packet format, the metadata in audio packet format includes:

[0071] The attribute area includes the audio packet format identifier and the audio packet format name of the audio packet, and the audio packet format identifier includes information indicating that the audio type of the audio packet is a binaural channel type;

[0072] Sub-element area, including: first reference information, second reference information and absolute distance, the first reference information includ...

Embodiment 3

[0079] Figure 4 A schematic structural diagram of an electronic device provided by Embodiment 3 of the present disclosure. Such as Figure 4 As shown, the electronic device includes: a processor 30 , a memory 31 , an input device 32 and an output device 33 . The number of processors 30 in the electronic device can be one or more, Figure 4 A processor 30 is taken as an example. The number of memory 31 in the electronic device can be one or more, Figure 4 Take a memory 31 as an example. The processor 30, the memory 31, the input device 32 and the output device 33 of the electronic device can be connected by bus or other methods, Figure 4 Take connection via bus as an example. The electronic device may be a computer, a server, and the like. Embodiments of the present disclosure are described in detail by using an electronic device as a server, and the server may be an independent server or a cluster server.

[0080]As a computer-readable storage medium, the memory 31 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a binaural audio packet format-based metadata and a generation method, equipment and a medium. The metadata of the audio packet format comprises an attribute area which comprises an audio packet format identifier and an audio packet format name of an audio packet, and the audio packet format identifier comprises information indicating that the audio type of the audio packet is a binaural channel type; the sub-element area comprises first reference information, second reference information and an absolute distance, the first reference information comprises audio channel format information adopted by an audio channel related to the audio packet during rendering, and the second reference information is indicated as preset invalid information; the preset invalid information is used for representing that the corresponding reference information does not exist in the audio packet of the binaural channel type during rendering, the absolute distance indication is a preset invalid value, and the preset invalid value is used for representing that the corresponding distance does not exist in the audio packet of the binaural channel type during rendering. And during rendering, reproduction of the three-dimensional sound can be realized in the binaural, so that the quality of a sound scene is improved.

Description

technical field [0001] The present disclosure relates to the technical field of audio processing, and in particular to a binaural audio packet-based metadata and generation method, device and medium. Background technique [0002] As technology develops, audio becomes more and more complex. From the early monophonic audio to stereo, the focus of work is also on the correct processing of the left and right channels. But with the advent of surround sound, the process started to get complicated. The surround 5.1 speaker system sorts and constrains multiple channels, and then the surround 6.1 speaker system, surround 7.1 speaker system, etc. make the audio processing ever-changing, and transmit the correct signal to the appropriate speaker to form an interrelated effect. Therefore, as sound becomes more immersive and interactive, the complexity of audio processing increases significantly. [0003] An audio channel (or channel) refers to mutually independent audio signals that ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L19/008H04R5/033

CPCG10L19/008H04R5/033

Inventor吴健

Owner赛因芯微(北京)电子科技有限公司

Metadata based on binaural audio packet format and generation method, device and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology