发明名称 METHOD AND SYSTEM OF FILTERING AND RECOMMENDING DOCUMENTS
摘要 Disclosed is a method and system for discovering documents using a computer and providing a small set of the most relevant documents to the attention of a human observer. Using the method, the computer obtains a seed document from the user and generates a seed document vector using term frequency-inverse corpus frequency weighting. A keyword index for a plurality of source documents can be compared with the weighted terms of the seed document vector. The comparison is then filtered to reduce the number of documents, which define an initial subset of the source documents. Initial subset vectors are generated and compared to the seed document vector to obtain a similarity value for each comparison. Based on the similarity value, the method then recommends one or more of the source documents.
申请公布号 US2013339373(A1) 申请公布日期 2013.12.19
申请号 US201313920803 申请日期 2013.06.18
申请人 UT-BATTELLE LLC 发明人 PATTON ROBERT M.;POTOK THOMAS E.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址