发明名称 |
String pattern conceptualization from detection of related concepts by analyzing substrings with common prefixes and suffixes |
摘要 |
A conceptualization method uses maximum or other substrings of a string pattern to find specific N-tuples of substring triples with N≧2 and m=1 . . . N inside a reference set (SET_r_i) of strings (STR_n_i). Each N-tuple is considered as a candidate for representing related concepts. Each concatenation of the substrings triples is an explicit member of the reference set (SET_r_i). Each middle substring out of middle substrings is unequal to another middle substring out of middle substrings within the substring triples found inside the reference set (SET_r_i). Each prefix substring (X_i) is equal to all other prefix substrings (X_i) within the substring triples found inside the reference set (SET_r_i). Each suffix substring (Z_i) is equal to all other suffix substrings (Z_i) within the substring triples found inside the reference set (SET_r_i). Either the prefix substring (X_i) or the suffix substring (Z_i) is not empty. |
申请公布号 |
US8311795(B2) |
申请公布日期 |
2012.11.13 |
申请号 |
US20080347070 |
申请日期 |
2008.12.31 |
申请人 |
ARNING ANDREAS;SEIFFERT ROLAND;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
ARNING ANDREAS;SEIFFERT ROLAND |
分类号 |
G06F17/20;G06F7/00;G06F17/27 |
主分类号 |
G06F17/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|