发明名称 Fast, efficient hardware mechanism for natural language determination
摘要 A language in which a document is written is identified by comparing the words of a document to the most frequently used words in a plurality of candidate languages. The words are stored in a plurality of sets of word tables, each set of word tables for storing a selected set of most frequently used words in a respective candidate language according to letter pairs in the words. In the preferred embodiment, each of the word tables is an NxN bit table, where each bit represents a given letter pair at a particular place in one of the most frequently used words in a respective candidate language. A set of table access registers, is used for accessing a respective set of word tables to compare words from the document to words stored in the word tables; each table access register accesses word tables for a respective candidate language. One or more word counting registers count a number of matches for a respective candidate language. A comparator selects a candidate language which corresponds to the word counting register having the highest count as the language in which the document is written.
申请公布号 US6002998(A) 申请公布日期 1999.12.14
申请号 US19960723818 申请日期 1996.09.30
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 MARTINO, MICHAEL JOHN;PAULSEN, JR., ROBERT CHARLES
分类号 G06F17/27;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址