发明名称 METHOD FOR MINIMIZING DATABASE SIZE OF N-GRAM LANGUAGE MODEL
摘要 The present invention relates to a method and a device for minimizing a database size of an n-gram language model. The method for minimizing the database size of the n-gram language model according to the present invention includes the steps of: generating one or more text data by analyzing an input signal; extracting a candidate text data in which a word corresponding to a dictionary database exists, from the one or more text data; calculating a hash value with respect to the candidate text data by using a hash function; and obtaining appearance frequency data corresponding to the hash value from an n-gram table corresponding to a word count of the candidate text data. The present invention has an effect that an index value is removed from the n-gram table by obtaining the appearance frequency data of the n-gram table through the hash value of the text data, thus a size of the n-gram table is reduced.
申请公布号 KR20160053587(A) 申请公布日期 2016.05.13
申请号 KR20140152841 申请日期 2014.11.05
申请人 DIOTEK CO., LTD. 发明人 MIN, CHUNG GI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址