发明名称 Alignment free methodology for rapid determination of differences between a test data set and known data sets
摘要 A method for generating data characterizing an item described by an ordered string of characters, comprises the steps of: (i) for a set of separation metrics each representing a unique number of positions of separation between arbitrary characters in a character group in the ordered string of characters, associating first with each separation metric; generating a set of character groups, wherein each character group comprises at least two characters contained within the ordered string of characters; and (ii) for at least one given character group in the set of character groups, for each given separation metric in the set of separation metrics, generating second data representing number of occurrences that the given character group satisfies the given separation metric; generating third data associated with the given character group, wherein the third data is based upon the second data and the first data; and storing the third data in memory for subsequent use.
申请公布号 US6434488(B1) 申请公布日期 2002.08.13
申请号 US19990454379 申请日期 1999.12.03
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ROBSON BARRY
分类号 G06F17/18;G06F19/00;(IPC1-7):G01N33/48;G06F17/14 主分类号 G06F17/18
代理机构 代理人
主权项
地址