发明名称 APPARATUS AND METHOD FOR RETRIEVING STRUCTURED DOCUMENTS
摘要 An apparatus for retrieving structured documents includes a first categorizing unit configured to categorize components into a first component of typical descriptions and a second component of atypical descriptions, based on statistics information for the components, a second categorizing unit configured to categorize the terms into a first term whose appearance ratio in the first component exceeds a threshold and a second term whose appearance ratio in the first component is not more than the threshold, an extraction unit configured to extract a set of structured documents each having the first component including the first term and the second component from the structured documents, and a ranking unit configured to rank the set of structured documents by a retrieval score calculating based o a relation between the second term and the second component.
申请公布号 US2009138473(A1) 申请公布日期 2009.05.28
申请号 US20080205636 申请日期 2008.09.05
申请人 KABUSHIKI KAISHA TOSHIBA 发明人 MANABE TOSHIHIKO;KOKUBU TOMOHARU
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址