Digital video consumes large amounts of storage and transmission capacity.
Many computers and computer networks lack the resources to process raw
digital video.
Compression can be lossless, in which the quality of the video does not suffer, but decreases in
bit rate are limited by the inherent amount of variability (sometimes called entropy) of the video data.
Or, compression can be lossy, in which the quality of the video suffers, but achievable decreases in
bit rate are more dramatic.
In certain encoding / decoding scenarios (e.g., high
bit rate interlaced video with lots of motion), limiting motion compensation for forward prediction to be relative to a single reference can hurt overall compression efficiency.
Problems with Previous Signaling of
Macroblock Information
While this signaling provides good overall performance in many cases, it does not adequately
exploit statistical dependencies between different signaled information in various common cases.
Further, it does not allow and address various useftl configurations such as presence / absence of CBPCY for 4MV macroblocks, or presence / absence of
motion vector data for 1MV macroblocks.
And, the signaling does not allow and address various useful configurations such as presence of coefficient information when
motion vector data is absent.
In certain encoding / decoding scenarios (e.g., high bit rate
interlaced video with lots of motion), limiting motion compensation for forward prediction to be relative to a single reference can hurt overall compression efficiency.
The
reference field selection signals for the motion vectors can consume a lot of bits.
The signaling of reference field selection information is inefficient in terms of pure coding efficiency.
Moreover, for some scenarios, however the information is encoded, the reference field selection information may consume so many bits that the benefits of prediction improvements from having multiple available references in motion compensation are outweighed.
However, the processes of managing reference picture lists and signaling reference picture selections are complex and consume an inefficient number of bits in some scenarios.
Finally, the standards are inflexible in their decisions of which code tables to use for
macroblock mode information.
Determining a motion vector predictor from a single neighbor suffices when motion is uniform, but is inefficient in many other cases.
These are efficient for more kinds of motion, but still do not adequately address scenarios in which there is a high degree of variance between the different candidate motion vector predictors, indicating discontinuity in motion patterns.
While JVT-D157 provides some flexibility in signaling weighted prediction parameters, the signaling mechanism is inefficient in various scenarios.
Whatever the benefits of previous video compression and decompression techniques, however, they do not have the advantages of the following techniques and tools.