1. Introduction
A typical multi-view/3D sequence consists of multiple videos captured simultaneously from the same scene by several cameras that are located at different positions. Multiview video coding is one of the latest amendments to the H.264/AVC standard [1] that allows us to compress multi-view/3D video streams more efficiently. But even with the improvement of MVC, multi-view/3D video is currently beyond the capabilities of most mobile devices and environments. One approach to reduce the amount of data is the scalability of the coded bitstream, which is particularly useful for the transmission of multi-view/3D video in heterogeneous environments where receivers have different bandwidth, display size, and processing power. Scalable Video Coding (SVC), in single view video, enables the decoding of partial bitstreams (called layers) to provide video sequences with lower temporal/spatial resolutions or reduced quality. Typically, a “Base Layer” (BL) is built which carries the minimum amount of video information that is necessary. One or more “Enhancement Layers” (EL) can then be built to increase the quality on top of the BL.