发明名称 VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING
摘要 Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.
申请公布号 US2009274434(A1) 申请公布日期 2009.11.05
申请号 US20080111202 申请日期 2008.04.29
申请人 MICROSOFT CORPORATION 发明人 MEI TAO;HUA XIAN-SHENG;LI SHIPENG;GU ZHIWEI
分类号 G11B27/00 主分类号 G11B27/00
代理机构 代理人
主权项
地址