发明名称 SYSTEMS AND METHODS FOR CREATING AND USING NAVIGABLE SPATIAL OVERVIEWS FOR VIDEO
摘要 Systems and methods for generating an overview for videos by reconstructing a representation of underlying content and linking from points in the overview to specific points in the video. Mechanisms are provided to create three different types of navigable overviews for different types of how-to and instructional videos. A two-dimensional overview is generated when content is two-dimensional, such as instructional videos on electronic whiteboard or other flat content. The three-dimensional overview is created when the content is three-dimensional, such as how-to videos illustrating the use of specific three-dimensional tangible articles. In three-dimensional case, when 3D model is available, the video segments are directly linked to corresponding points on the model. When a model is not available, a rough overview is first created from the captured video and camera orientation metadata. When the user selects a specific location within the overview, the related video segment is automatically played to the user.
申请公布号 US2014245151(A1) 申请公布日期 2014.08.28
申请号 US201313775116 申请日期 2013.02.22
申请人 FUJI XEROX CO., LTD. 发明人 Carter Scott;Cooper Matthew L.;Adcock John;Branham Stacy
分类号 G06F3/0484 主分类号 G06F3/0484
代理机构 代理人
主权项 1. A computer-implemented method performed in a computerized system comprising a central processing unit, a display device and a memory, the computer-implemented method performed in connection with a video of an article, the computer-implemented method comprising: a. using the central processing unit to segment the video based at least on time and a camera orientation metadata into a plurality of video segments; b. obtaining a plurality of images corresponding to a plurality of sides of the article; c. using the central processing unit to map each of the plurality of images to at least one of the plurality of video segments; d. generating a graphical user interface on the display device, the graphical user interface displaying at least one of the plurality of images based on user selection; and e. playing the at least one of the plurality of video segments mapped to the displayed one of the plurality of images.
地址 Tokyo JP