发明名称 IMPROVED SIMILAR DOCUMENT DETECTING METHOD, DEVICE, AND COMPUTER-READABLE RECORDING MEDIUM
摘要 <P>PROBLEM TO BE SOLVED: To provide an improved similar document detection method. <P>SOLUTION: A method which is executed by a similar document detection device includes the steps of: extracting entities of which an importance contributing element to be calculated, from plural web documents; calculating weight values for the respective entities based on the calculated importance contributing elements; and determining whether the plural web documents are similar to each other on the basis of the calculated weight values. In plural documents which are possibly similar documents, a core portion and non-core portion are discriminated from each other in each document, and a different weight value is given to each portion to determine similar documents by an improved manner; and thus, the accuracy of a search engine is improved. <P>COPYRIGHT: (C)2013,JPO&INPIT
申请公布号 JP2012234522(A) 申请公布日期 2012.11.29
申请号 JP20120063358 申请日期 2012.03.21
申请人 NHN CORP 发明人 LEE CHAE HYUN;SHIM SEON-IL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址