Disclosed are scalable quantizers for audio and other signals characterized by a non-uniform,
perception-based
distortion metric, that operate in a common companded domain which includes both the base-layer and one or more enhancement-
layers. The common companded domain is designed to permit use of the same unweighted MSE metric for optimal quantization parameter selection in multiple
layers, exploiting the statistical dependence of the enhancement-layer
signal on the quantization parameters used in the preceding layer. One embodiment features an asymptotically optimal entropy coded uniform scalar quantizer. Another embodiment is an improved
bit rate scalable multi-layer Advanced Audio Coder (AAC) which extends the
scalability of the asymptotically optimal entropy coded uniform scalar quantizer to systems with non-uniform base-layer quantization, selecting the enhancement-layer quantization methodology to be used in a particular band based on the preceding layer quantization coefficients. In the important case that the source is well modeled as Laplacian, the optimal conditional quantizer is implementable by only two distinct switchable quantizers depending on whether or not the previous quantizer identified the band in question as a so-called “zero dead-zone:” Hence, major savings in
bit rate are recouped at virtually no additional computational cost. For example, the proposed four layer scalable coder consisting of 16 kbps
layers achieves performance close to a 60 kbps non-scalable coder on the
standard test database of 44.1 kHz audio.