发明名称 STRUCTURED DOCUMENT PROCESSOR, STRUCTURED DOCUMENT PROCESSING METHOD AND STRUCTURED DOCUMENT PROCESSING PROGRAM
摘要 PROBLEM TO BE SOLVED: To provide a structured document processor, a structured document processing method, and a structured document processing program capable of retrieving a document suitable to a purpose while suppressing the retrieval processing cost. SOLUTION: The structured document processor comprises: a structured document holding means 120 for holding a plurality of structured documents; a character string frequency holding means 122 for associating a character string included in the structured document held by the structured document holding means 120 with character string frequency being appearance frequency of the character string in the plurality of structured documents, and holding them; and an index information holding means 126 for holding character string identification information which identifies the character string to be specified by the index key in association with the index key, wherein, to a frequent character string whose character string frequency is not less than a preset threshold out of character strings held by the character string frequency holding means 122, the frequent character string and structure identification information indicating a position where the frequent character string appears in the structured document are set as the traction keys. COPYRIGHT: (C)2007,JPO&INPIT
申请公布号 JP2007226453(A) 申请公布日期 2007.09.06
申请号 JP20060045808 申请日期 2006.02.22
申请人 TOSHIBA CORP 发明人 MURAI AKIKO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址