摘要 |
<P>PROBLEM TO BE SOLVED: To automatically detect one or a plurality of concepts from multimedia. <P>SOLUTION: Low level features are extracted representative of one or a plurality of concepts (310). A discriminative classifier is trained using these low level features (340). A collective annotation model is built based on the discriminative classifiers (380). The frame work is totally generic and can be applied with any number of low-level features or discriminative classifiers. Further, the analysis makes no domain specific assumptions, and can be applied to activity analysis or other scenarios without modification. The framework admits the inclusion of a broad class of potential functions, hence enabling multi-modal analysis and the fusion of heterogeneous information sources. <P>COPYRIGHT: (C)2008,JPO&INPIT |