摘要 |
From a plurality of images acquired by capturing images from a plurality of viewpoints, a first image having a region at a first in-focus depth, a second image having a region at a second in-focus depth, which is different from the first in-focus depth, and a third image having a region at an in-focus depth between the first in-focus depth and the second in-focus depth are generated. The first image, the third image, and the second image are displayed on a display unit. A sound is generated from a sound associated with the first image and that associated with the second image. The generated sound is reproduced while the first image, the third image, and the second image are displayed on the display unit one by one. |