发明名称 Systems and methods for feature fusion
摘要 Systems and methods for generating visual words define initial inter-visual word relationships between a plurality of visual words; define visual word-image relationships between the plurality of visual words and a plurality of images; define inter-image relationships between the plurality of images; generate revised inter-visual word relationships in a vector space based on the initial inter-visual word relationships, the inter-image relationships, and the visual word-image relationships; and generate higher-level visual words in the vector space based on the revised inter-visual word relationships.
申请公布号 US9190026(B2) 申请公布日期 2015.11.17
申请号 US201313829338 申请日期 2013.03.14
申请人 Canon Kabushiki Kaisha 发明人 Yang Yang;Denney Bradley Scott;Lu Juwei;Dusberger Dariusz;Huang Hung Khei
分类号 G06T11/00;G09G5/24;G06T11/20;G06K9/62 主分类号 G06T11/00
代理机构 Canon U.S.A., Inc. IP Division 代理人 Canon U.S.A., Inc. IP Division
主权项 1. A method comprising: defining initial inter-visual word relationships between a plurality of visual words that were generated based on visual features; defining visual word-image relationships between the plurality of visual words and a plurality of images; defining inter-image relationships between the plurality of images; generating revised inter-visual word relationships in a vector space based on the initial inter-visual word relationships, the inter-image relationships, and the visual word-image relationships, wherein generating the revised inter-visual word relationships in the vector space based on the initial inter-visual word relationships, the inter-image relationships, and the visual word-image relationships includes generating a diffusion map that represents the initial inter-visual word relationships, the inter-image relationships, and the visual word-image relationships,wherein the diffusion map includes nodes,wherein the nodes represents a respective visual word or a respective image, andwherein a diffusion distance between two nodes in the diffusion map is based on a likelihood that a Markov chain transits from each of the two nodes to a same node by following any arbitrary path that has a length of a given number of transition steps; and generating higher-level visual words in the vector space based on the revised inter-visual word relationships.
地址 Tokyo JP