发明名称 |
Systems and methods for facilitating identification of and interaction with objects in a video or image frame |
摘要 |
Systems and methods for facilitating identification of and interaction with objects in a video frame are provided. In some embodiments, a system can include a computer-readable storage medium encoding computer executable components, and a processor that executes computer executable components encoded within the computer-readable storage medium. The components can include: a communication component that receives a video; a segmentation component that obtains a frame from the video; and a selection component that determines an object selected within the frame. The selection component can include a classifier trained using a probability map stored in the memory. The probability map can include information indicative of a likelihood that a pixel in the frame corresponds to the object, and can be generated based on crowdsourcing object differentiation. |
申请公布号 |
US8837819(B1) |
申请公布日期 |
2014.09.16 |
申请号 |
US201213440901 |
申请日期 |
2012.04.05 |
申请人 |
Google Inc. |
发明人 |
Lees Jennie;Huang Jonathan |
分类号 |
G06K9/00 |
主分类号 |
G06K9/00 |
代理机构 |
Amin, Turocy & Watson, LLP |
代理人 |
Amin, Turocy & Watson, LLP |
主权项 |
1. A system, comprising:
a memory storing computer executable components; and a processor configured to execute the following computer executable components stored in the memory:
a communication component that receives a video;a segmentation component that obtains a frame from the video; anda selection component that determines an object selected within the frame and comprises a classifier trained using a probability map stored in the memory, wherein the probability map comprises information indicative of a likelihood that a pixel in the frame corresponds to the object, and is generated based, at least, on:
information indicative of masking of a plurality of objects with one or more colors, the masking generated by a plurality of users and the plurality of objects being included in a scene of the frame;differentiation of the plurality of objects based, at least, on a level of commonality between the masking of the plurality of objects; andselection of information associated with masked regions of the frame wherein the level of commonality between the masking is greater than or substantially equal to a defined threshold. |
地址 |
Mountain View CA US |