发明名称 |
Enhanced max margin learning on multimodal data mining in a multimedia database |
摘要 |
Multimodal data mining in a multimedia database is addressed as a structured prediction problem, wherein mapping from input to the structured and interdependent output variables is learned. A system and method for multimodal data mining is provided, comprising defining a multimodal data set comprising image information; representing image information of a data object as a set of feature vectors in a feature space; clustering in the feature space to group similar features; associating a non-image representation with a respective image data object based on the clustering; determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation; optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and employing the mathematical weighted combination for automatically classifying a new data object.
|
申请公布号 |
US8463053(B1) |
申请公布日期 |
2013.06.11 |
申请号 |
US20090538845 |
申请日期 |
2009.08.10 |
申请人 |
GUO ZHEN;ZHANG ZHONGFEI (MARK);THE RESEARCH FOUNDATION OF STATE UNIVERSITY OF NEW YORK |
发明人 |
GUO ZHEN;ZHANG ZHONGFEI (MARK) |
分类号 |
G06K9/62 |
主分类号 |
G06K9/62 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|