A process and apparatus for digital compression of multiview video, supplied by additional data of scene depth. The method of coding is offered, including: each frame of the multiview video sequence, encoded again, determined according to the predefined order of coding, is represented as a collection of non-overlapped blocks, such that at least one already encoded frame is detected, corresponding to the given view and designated as reference, the synthesized frames for encoded and reference frames, differing that for each non-overlapped block of pixels of the encoded frame designated as an encoded block the spatial-combined block in the synthesized frame is determined, corresponding to the encoded frame, designated as the virtual block, for which spatial position of the block of pixels is determined in the synthesized frame corresponding to a reference frame.