发明名称 Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
摘要 A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N<SUP>2</SUP>-N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than <maths id="MATH-US-00001" num="1"> <MATH OVERFLOW="SCROLL"> <MFRAC> <MROW> <MSUP> <MI>N</MI> <MN>2</MN> </MSUP> <MO>-</MO> <MI>N</MI> </MROW> <MN>2</MN> </MFRAC> </MATH> </MATHS> in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.
申请公布号 US2007136336(A1) 申请公布日期 2007.06.14
申请号 US20050298500 申请日期 2005.12.12
申请人 CLAIRVOYANCE CORPORATION 发明人 SHANAHAN JAMES G.;ROMA NORBERT;EVANS DAVID A.
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址