Automatic language identification using both N-gram and word information,申请号US19980219615-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Automatic language identification using both N-gram and word information
摘要	The predominant language of a sample text is automatically identified using probability data that include N-gram probability data for at least one language and word probability data for at least one language. The N-gram probability data of a language indicate, for each N-gram, the probability that it occurs if the language is predominant. Similarly, the word probability data of a language indicate, for each word, the probability that it occurs if the language is predominant. The probability data are used to automatically obtain sample probability data for at least two languages. The sample probability data include N-gram probability information for at least one language and word probability information for at least one language. The sample probability data are used to automatically obtain language identifying data identifying the language whose sample probability data indicate the highest probability. The N-grams can be trigrams, while the words can be short words of no more than five characters. Some languages can have both trigram and word probabilities, while some can have only trigram probabilities.
申请公布号	US6167369(A)	申请公布日期	2000.12.26
申请号	US19980219615	申请日期	1998.12.23
申请人	XEROX COMPANY	发明人	SCHULZE, BRUNO M.
分类号	G06F17/27;G06F17/28;(IPC1-7):G06F17/27	主分类号	G06F17/27
代理机构		代理人
主权项
地址

您可能感兴趣的专利

溅射靶及其制造方法

精密仪器用的润滑脂组合物以及使用该组合物的表

Transition metal complex, process for producing the same, olefin polymerization catalyst containing the transition metal complex and process for producing olefin polymers

CONNECTION METHOD AND CONNECTION STRUCTURE OF PLASTIC PIPE

METHOD FOR QUANTITATING CREATININE DISCHARGED IN URINE, METHOD FOR MEASURING URINE COMPONENT AND APPARATUS THEREFOR

Optical element holding device for exposure apparatus

Molded article located in the beam path of radar device

Method for operating a lifting apparatus and/or a conveyor and lifting apparatus and/or conveyor

Cover for duvet or pillow comprising two interchangeable panels, and set of bed linen comprising such cover

Package of toothbrushes

Navigation unit in hinge

Portable terminal device and communication control method

Wire ring net for rocky wall barriers and method for making it

AD MARKET SYSTEM AND METHOD

Gas feed pipe connecting screw for continuous casting nozzle

Controller for driving a permanent magnet type synchronous motor

Dry analytical element for high-density lipoprotein cholesterol quantification

Aqueous traffic paint and method of application

Combined tape player and optical media reader

Process for producing granular anionic surfactant