摘要 |
<p>Interactive multi-view video presents new types of video capture systems, video formats, video compression algorithms, and services. Many video cameras are allocated to capture an event from various related locations and directions. The captured videos are compressed and are sent to a server in real-time. The compressed video can also be transcoded through an off-line compression approach to further reduce the data amount. A key idea of off-line compression is to decompose all views into a 3D mapping, which consists of a group of feature points in the 3D environment. Each feature point is represented by its 3D coordinates (x, y, z) and the corresponding color components (Y, U, V). The created mapping is the minimum set of feature points that can reconstruct all of the pixels in each view. After the 3D mapping creation, the obtained feature points are predicted and transformed to further decompose the correlations among them. The transformed results are quantized and encoded as a ' base layer ' bit stream. The dequantized feature points are mapped back onto each view to form a predicted view image. The predicted image is close to the original one; however, there are still some differences between them. The difference is encoded independently as an ' enhancement layer ' of each view image.</p> |