发明名称 SYSTEM AND METHOD FOR UTILIZING MULTIPLE ENCODINGS TO IDENTIFY SIMILAR LANGUAGE CHARACTERS
摘要 Described herein are systems and methods for identifying the similarity between language characters. As described herein, a pair of language characters is received at a language character match engine. The language character match engine is adapted to receive encoding configuration information from each of a plurality of encoding components, and is adapted to encode the pair of language characters based on the unique structure of each language character to generate a pair of string identification characters for each encoding component. Thereafter, each pair of string identification characters is compared to one another to generate a similarity score, and the similarity score for each pair of string identification characters is combined to create a composite similarity score. The composite similarity score represents a similarity between the pair of language characters, and is used to identify the similarity between the pair of language characters.
申请公布号 US2014052436(A1) 申请公布日期 2014.02.20
申请号 US201213566385 申请日期 2012.08.03
申请人 QIAN JUN;OUAGUENOUNI SOFIANE;ORACLE INTERNATIONAL CORPORATION 发明人 QIAN JUN;OUAGUENOUNI SOFIANE
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址