发明名称 |
Scanning and tracking dynamic objects with depth cameras |
摘要 |
A video conference server receives a plurality of video frames including a current frame and at least one previous frame. Each of the video frames includes a corresponding image and a corresponding depth map. The server produces a directional distance function (DDF) field that represents an area surrounding a target surface of the object captured in the current frame. A forward transformation is generated that modifies the reference surface to align with the target surface. Using at least a portion of the forward transformation, a backward transformation is calculated that modifies the target surface of the current frame to align with the reference surface. The backward transformation is then applied to the DDF to generate a transformed DDF. The server updates the reference model with the transformed DDF and transmits data for the updated reference model to enable a representation of the object to be produced at a remote location. |
申请公布号 |
US9299195(B2) |
申请公布日期 |
2016.03.29 |
申请号 |
US201414224575 |
申请日期 |
2014.03.25 |
申请人 |
Cisco Technology, Inc. |
发明人 |
Dou Mingsong;Fuchs Henry;Marathe Madhav |
分类号 |
G06T17/00;G06T19/20;G06T7/00;H04N7/15 |
主分类号 |
G06T17/00 |
代理机构 |
Edell, Shapiro & Finnan, LLC |
代理人 |
Edell, Shapiro & Finnan, LLC |
主权项 |
1. A method comprising:
receiving a plurality of video frames comprising a current frame and at least one previous frame, wherein each of the plurality of video frames includes data for a corresponding image and data for a corresponding depth map; receiving a reference model comprising data from the at least one previous frame, the reference model including data representing a reference surface corresponding to at least one dynamic object captured in the plurality of video frames; processing the data representing the depth map of the current frame to produce data for a directional distance function (DDF) field that represents an area surrounding a target surface of the at least one dynamic object captured in the current frame; generating a forward transformation that modifies the data representing the reference surface to align with data representing the target surface and generate data for a modified reference surface; calculating a backward transformation that modifies data representing the target surface of the current frame to align with the reference surface, wherein the backward transformation uses at least a portion of the forward transformation; generating a transformed DDF by applying the backward transformation to the DDF; fusing the transformed DDF into a multi-mode distance function (MDDF) comprising multiple distance values and direction vectors at each of a plurality of voxels; and updating the reference model based on the MDDF to produce data for an updated reference model. |
地址 |
San Jose CA US |