发明名称 System and method for contexually interpreting image sequences
摘要 A system and method for contextually interpreting image sequences are provided. The method comprises receiving video from one or more video sources, and generating one or more questions associated with one or more portions of the video based on at least one user-defined objective. The method further comprises sending the one or more portions of the video and the one or more questions to one or more assistants, receiving one or more answers to the one or more questions from the one or more assistants, and determining a contextual interpretation of the video based on the one or more answers and the video.
申请公布号 US9355318(B2) 申请公布日期 2016.05.31
申请号 US201414522078 申请日期 2014.10.23
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Hariharan Rajaraman;Ramanathan Sri;Subbian Karthik;Trevathan Matthew B.
分类号 G06K9/00;H04L9/10;G06F12/00;H04H60/76;G06F9/45;H04H60/48;H04L12/24;H04N1/00;H04H20/65;G06F17/30 主分类号 G06K9/00
代理机构 Roberts Mlotkowski Safran & Cole, P.C. 代理人 Chung Matthew;Roberts Mlotkowski Safran & Cole, P.C.
主权项 1. A method, comprising: receiving, by a computer, video from one or more video sources; generating, by the computer, one or more questions associated with one or more portions of the video; sending, by the computer, the one or more portions of the video and the one or more questions to one or more assistants; receiving, by the computer, one or more answers to the one or more questions from the one or more assistants; and determining, by the computer, a contextual interpretation of the video based on the one or more answers and the video, wherein: the video comprises recorded image sequences recorded by a surveillance system; the one or more assistants are one or more persons; and the determining the contextual interpretation by the computer uses interpretations of both the one or more persons and the computer; further comprising identifying, by the computer, the one or more portions of the video for which the determining the contextual interpretation requires assistance based on at least one user-defined objective for the contextual interpretation; wherein the at least one user-defined objective comprises identifying, based on one or more criterion defined by the user, one of: (i) one or more items possessed by a person in the video, and (ii) a person in the video having difficulty breathing.
地址 Armonk NY US