发明名称 Systems and methods for facilitating identification of and interaction with objects in a video or image frame
摘要 Systems and methods for facilitating identification of and interaction with objects in a video frame are provided. In some embodiments, a system can include a computer-readable storage medium encoding computer executable components, and a processor that executes computer executable components encoded within the computer-readable storage medium. The components can include: a communication component that receives a video; a segmentation component that obtains a frame from the video; and a selection component that determines an object selected within the frame. The selection component can include a classifier trained using a probability map stored in the memory. The probability map can include information indicative of a likelihood that a pixel in the frame corresponds to the object, and can be generated based on crowdsourcing object differentiation.
申请公布号 US8837819(B1) 申请公布日期 2014.09.16
申请号 US201213440901 申请日期 2012.04.05
申请人 Google Inc. 发明人 Lees Jennie;Huang Jonathan
分类号 G06K9/00 主分类号 G06K9/00
代理机构 Amin, Turocy & Watson, LLP 代理人 Amin, Turocy & Watson, LLP
主权项 1. A system, comprising: a memory storing computer executable components; and a processor configured to execute the following computer executable components stored in the memory: a communication component that receives a video;a segmentation component that obtains a frame from the video; anda selection component that determines an object selected within the frame and comprises a classifier trained using a probability map stored in the memory, wherein the probability map comprises information indicative of a likelihood that a pixel in the frame corresponds to the object, and is generated based, at least, on: information indicative of masking of a plurality of objects with one or more colors, the masking generated by a plurality of users and the plurality of objects being included in a scene of the frame;differentiation of the plurality of objects based, at least, on a level of commonality between the masking of the plurality of objects; andselection of information associated with masked regions of the frame wherein the level of commonality between the masking is greater than or substantially equal to a defined threshold.
地址 Mountain View CA US