发明名称 Document analysis and retrieval
摘要 A computer system configured to implement a method for document analysis and retrieval. A document that includes text is received from a host. Document keys (i.e., keywords and keyphrases) associated with the text are generated. In first embodiments, a provided document taxonomy has categories and associated category keys (i.e., keywords and keyphrases). The category keys of each category are compared with the document keys to determine a distance between the document and each category as a measure of how close the document is to each category. A subset of the categories is returned to the host, wherein the subset of the categories reflects the determined distances. In second embodiments, a search string is created as a logical function of a subset of the document keys. The search string is submitted to a search engine. Links to related documents are received from the search engine and returned to the host.
申请公布号 US8015171(B2) 申请公布日期 2011.09.06
申请号 US20080172507 申请日期 2008.07.14
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 GOSBY DESIREE D. G.;ITO KEITH I
分类号 G06F17/00;G06F17/30 主分类号 G06F17/00
代理机构 代理人
主权项
地址