发明名称 |
AUTOMATIC, COMPUTER-BASED SIMILARITY CALCULATION SYSTEM FOR QUANTIFYING THE SIMILARITY OF TEXT EXPRESSIONS |
摘要 |
The invention relates to a device and a method for the automatic, computer-based weighting of the similarity of text expressions. The inventive system or method comprises a document database unit (1), a candidate expression storage unit (2), and a similarity weight value calculation unit (3) while being characterized in that the similarity weight values agw(t<SUB>1</SUB>, t<SUB>2</SUB>) for the individual pairs of expressions can be calculated based on a degree of similarity occ_con(t<SUB>1</SUB>, t<SUB>2</SUB>) that takes into account both the total frequency with which the two expressions of a pair of expressions are used within one and the same text segment in a number of several text segments and the total number of different context expressions in said number of text segments. |
申请公布号 |
WO2007048607(A2) |
申请公布日期 |
2007.05.03 |
申请号 |
WO2006EP10332 |
申请日期 |
2006.10.26 |
申请人 |
FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.;CHEN, LIBO;THIEL, ULRICH;FANKHAUSER, PETER;KAMPS, THOMAS |
发明人 |
CHEN, LIBO;THIEL, ULRICH;FANKHAUSER, PETER;KAMPS, THOMAS |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|