发明名称 FRAME BY FRAME, PIXEL BY PIXEL MATCHING OF MODEL-GENERATED GRAPHICS IMAGES TO CAMERA FRAMES FOR COMPUTER VISION
摘要 There are two distinct tasks in vision or image processing. On the one hand there is the difficult task of image analysis and feature recognition, and on the other there is the less difficult task of computing the 3D world position of the camera given an input image. In biological vision, these two tasks are intertwined together such that it is difficult to distinguish one from the other. We perceive our position in world coordinates by recognizing and triangulating from features around us. It seems we can not triangulate if we don't identify first the features we triangulate from and we can't really identify unless we can place a feature somewhere in the 3D world we live in. Most, if not all, vision systems in prior art are an attempt to implement both tasks in the same system. For instance, reference U.S. Pat. No. 5,801,970 comprises both tasks; reference U.S. Pat. No. 6,704,621 seems to comprise of triangulation alone, but it actually requires recognition of the road. If the triangulation task can indeed be made separate from and independent of the analysis and feature recognition tasks, then we would need half as much computing resources in a system that does not perform the latter task. By taking advantage of current advances in graphics processing, this invention allows for triangulation of the camera position without the usual scene analysis and feature recognition. It utilizes an a priori, accurate model of the world within the field of vision. The 3D model is rendered onto a graphics surface using the latest graphics processing units. Each frame coming from the camera is then searched for a best match in a number of candidate renderings on the graphics surface. The count of rendered images to compare to is made small by computing the change in camera position and angle of view from one frame to another, and then using the results of such computations to limit the next possible positions and angles of view to render the a priori world model. The main advantage of this invention over prior art is the mapping of the real world onto a world model. One application for which this is most suited is robotic programming. A robot that is guided by an a priori map and that knows its position in that map is far more superior to one that is not so guided. It is superior with regards to navigation, homing, path finding, obstacle avoidance, aiming for point of attention, and other robotic tasks.
申请公布号 US2010283778(A1) 申请公布日期 2010.11.11
申请号 US20060993169 申请日期 2006.09.12
申请人 TAPANG CARLOS CORTES 发明人 TAPANG CARLOS CORTES
分类号 G06T15/00 主分类号 G06T15/00
代理机构 代理人
主权项
地址