摘要 |
In order to assist a viewer in understanding the contents of a moving image, it is desired to plainly represent the contents of metadata to the viewer. The metadata relevant to the moving image has a stream data structure including one or more access units each being a data unit which can be independently processed. Each of the access units includes first data to specify an effective period defined with respect to a time axis of the moving image, object area data describing a spatio-temporal region in the moving image, and balloon data to display relevant data of the object by a balloon. The balloon data includes text data to be displayed in the inside of the balloon and to express the contents of the object and position specifying data to specify a display position of the balloon.
|