Unlock instant, AI-driven research and patent intelligence for your innovation.

Audio object clustering by utilizing temporal variations of audio objects

A time-varying, object technology, applied in speech analysis, stereo systems, instruments, etc., can solve the problems of low inter-frame stability and auditory defects in the clustering process, and achieve the effect of improving allocation stability and avoiding defects.

Active Publication Date: 2015-09-02
DOLBY LAB LICENSING CORP
View PDF17 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

As a result, the inter-frame stability of the clustering process is relatively low, which is likely to cause auditory artifacts when rendering audio object classes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio object clustering by utilizing temporal variations of audio objects
  • Audio object clustering by utilizing temporal variations of audio objects
  • Audio object clustering by utilizing temporal variations of audio objects

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The principles of the invention will be described below with reference to several example embodiments shown in the accompanying drawings. It should be understood that these embodiments are described only to enable those skilled in the art to better understand and implement the present invention, but not to limit the scope of the present invention in any way.

[0018] As mentioned above, in known audio object clustering schemes, the assignment of objects to classes is sometimes unstable. Stable assignment here means that audio objects (at least for those static objects) are consistently assigned to cluster centers with the same location. For audio objects with fixed locations, the assignment of objects to classes is usually determined by the location of the chosen cluster centers. If the location of the center is relatively stable, the assignment of objects to classes will also be relatively stable. On the contrary, if the cluster centers move or even jump from one loc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

Description

technical field [0001] The present invention relates generally to audio object clustering, and more particularly to methods and systems for using temporal variation of audio objects in audio object clustering. Background technique [0002] Traditionally, audio content is created and stored in a channel based format. The term "audio channel" or "channel" as used herein refers to audio content, usually having a predefined physical location. For example, stereo, surround 5.1, surround 7.1, etc. are all channel-based formats for audio content. Recently, many traditional multi-channel systems have been extended to support a new format that includes both channels and audio objects. The term "audio object" or simply "object" as used herein refers to an individual audio element that exists in a sound field for a certain duration. An audio object can be dynamic or static. For example, an audio object may be a person, an animal, or any other element capable of acting as a sound so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/022H04S7/00
CPCG10L19/20H04S7/30G10L25/48G10L25/03G10L19/022G10L25/21
Inventor 陈连武芦烈J·布里巴特
Owner DOLBY LAB LICENSING CORP
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More