发明名称 System and method for keyword spotting using multiple character encoding schemes
摘要 Methods and systems for finding search phrases in a body of data that is encoded using any of multiple possible character encoding schemes. An analytics system accepts an input search phrase for searching in a certain body of data. The system identifies two or more candidate character encoding schemes, which may have been used for encoding the body of data. Having determined the candidate encoding schemes, the system translates the input search phrase into multiple encoding-specific search phrases that represent the input search phrase in the respective candidate encoding schemes. The system then searches the body of data for occurrences of the input search phrase using the multiple encoding-specific search phrases.
申请公布号 US8990238(B2) 申请公布日期 2015.03.24
申请号 US201213457373 申请日期 2012.04.26
申请人 Verint Systems Ltd. 发明人 Goldfarb Eithan
分类号 G06F17/30;G06F7/00;G06F17/22;G06F17/27 主分类号 G06F17/30
代理机构 Meunier Carlin & Curfman 代理人 Meunier Carlin & Curfman
主权项 1. A method, comprising: accepting an input search phrase to be located in a body of data; identifying multiple candidate character encoding schemes using one or more characteristics of the input search phrase; translating the input search phrase into multiple encoding-specific search phrases, each encoding-specific search phrase representing the input search phrase in a different, respective candidate character encoding scheme; and identifying one or more occurrences of the input search phrase in the body of data by searching the body of data using each of the multiple encoding-specific search phrases.
地址 Herzelia, Pituach IL
您可能感兴趣的专利