发明名称 Information retrieval with non-negative matrix factorization
摘要 Disclosed is a method of indexing a database of documents, comprising providing a vocabulary of n terms, indexing the database in the form of a non-negative nxm index matrix V, wherein each of its m columns represents an jth document having n entries containing a function of the number of occurrences of a ith term of said vocabulary appearing in said jth document, factoring out non-negative matrix factors T and D such that V≈TD, and wherein T is an nxr term matrix, D is an rxm document matrix, and r<nm/(n+m). The index so generated is useful in two-pass information retrieval systems.
申请公布号 US2003018604(A1) 申请公布日期 2003.01.23
申请号 US20010862524 申请日期 2001.05.22
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 FRANZ MARTIN;MCCARLEY JEFFREY S.
分类号 G06F17/30;(IPC1-7):G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址