发明名称 DOCUMENT SEARCH DEVICE, METHOD, AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To improve a search speed and search accuracy in search of an electronic document including a table in the text. SOLUTION: In search for the electronic document based on a vector space method, when a search condition described as a farmland to be sold in Yosino-cho, Yosino-gun, Nara prefecture is inputted, for example, a document such as a selection chart, which includes words such as Yosino-gun, Yosino-cho, and a farmland but includes no information about the farmland existing in the Yosino-cho in Yosino-gun, Nara prefecture, may be erroneously determined as a suitable document. In this search method, each of cells inside a table in the document is assumed as one document, and plurality of document vectors equivalent in number to the cells are generated for the document shown in the figure. A distance between the document vectors and a search vector generated for the search condition is computed, and the document corresponding to the document vector is determined as a suitable document when at least one document vector gives a distance below a threshold value to the search vector. It is effective when a value inside the cell is comparatively long. COPYRIGHT: (C)2005,JPO&NCIPI
申请公布号 JP2005227813(A) 申请公布日期 2005.08.25
申请号 JP20040032879 申请日期 2004.02.10
申请人 JUST SYST CORP 发明人 TANIOKA HIROKI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址