发明名称 |
DOCUMENT SIMILARITY DETERMINING METHOD, UNIT AND PROGRAM |
摘要 |
<P>PROBLEM TO BE SOLVED: To provide a technology to detect a degree of similarity between documents in which text and non-text information are intermingled. <P>SOLUTION: In a method which can be executed on a computer for detecting a degree of similarity between two documents which include a text object, a non-text object or a composite thereof, the method comprises a step to convert each document data into a digraph and to store the same; and a step to calculate a degree of similarity between the converted digraphs by an arithmetic processing of the computer taking into consideration of an importance of the object. <P>COPYRIGHT: (C)2012,JPO&INPIT |
申请公布号 |
JP2011233023(A) |
申请公布日期 |
2011.11.17 |
申请号 |
JP20100104088 |
申请日期 |
2010.04.28 |
申请人 |
INTERNATIONAL BUSINESS MASCHINES CORPORATION |
发明人 |
MISHINA TAKUYA;YOSHIHAMA SACHIKO |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|