Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building,申请号US20050180153-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building
摘要	Calculates a word n-gram probability with high accuracy in a situation where a first corpus), which is a relatively small corpus containing manually segmented word information, and a second corpus, which is a relatively large corpus, are given as a training corpus that is storage containing vast quantities of sample sentences. Vocabulary including contextual information is expanded from words occurring in first corpus of relatively small size to words occurring in second corpus of relatively large size by using a word n-gram probability estimated from an unknown word model and the raw corpus. The first corpus (word-segmented) is used for calculating n-grams and the probability that the word boundary between two adjacent characters will be the boundary of two words (segmentation probability). The second corpus (word-unsegmented), in which probabilistic word boundaries are assigned based on information in the first corpus (word-segmented), is used for calculating a word n-grams.
申请公布号	US2006015326(A1)	申请公布日期	2006.01.19
申请号	US20050180153	申请日期	2005.07.13
申请人	INTERNATIONAL BUSINESS MACHINES CORPORATION	发明人	MORI SHINSUKE;TAKUMA DAISUKE
分类号	G06F17/27	主分类号	G06F17/27
代理机构		代理人
主权项
地址

您可能感兴趣的专利

Treating surfaces to enhance bio-compatibility.

Selective dopamine D3 receptor agonists for the treatment of sexual dysfunction.

Methods for reducing immunogenicity of polypeptides.

PYRAZOLO[4,3-D]PYRIMIDINONE COMPOUNDS AS CGMP PDE INHIBITORS

INDOLE DERIVATIVES AS BETA-2 AGONISTS.

PROCESS FOR EPOXIDATION AND CATALYST TO BE USED THEREIN.

PLATFORM SYSTEM AND METHOD FOR EXTENDING SALES AND USE OF A RESOURCE OF MOTIVATIONAL PROGRAMS.

Procedure for the optimized transfer of ATM cells over connection links

METHODS AND SYSTEMS FOR REDUCING INTERFERENCE USING CO-CHANNEL INTERFERENCE MAPPING

METHOD FOR ADMINISTERING THERMOTHERAPY TO PREVENT THE GROWTH OF TUMORS.

METHOD OF PRODUCING POLYAMIDE NANOCOMPOSITES AND INJECTION MOLDED PARTS PRODUCIBLE THEREFROM.

A method and apparatus for securely accessing data or functionality of a device

CONTACT RING FOR ON-CELL BATTERY TESTER

METHOD AND DEVICE FOR MEASURING INTERNAL INFORMATION OF SCATTERING ABSORBER

Transparent polyester film with improved barrier to water vapour, method of preparation and usage

METHOD FOR MAKING FLOAT GLASS HAVING REDUCED DEFECT DENSITY.

BALLISTIC RESISANT AND FIRE RESISTANT COMPOSITE ARTICLES.

CELL-BASED FLUORESCENCE RESONANCE ENERGY TRANSFER (FRET) ASSAYS FOR CLOSTRIDIAL TOXINS.

POLYMER DISPERSIONS WITH LOW VISCOSITY AND METHOD FOR PRODUCTION THEREOF.

METHODS AND SYSTEMS FOR REMOTELY ACCESSING A DIGITAL TELEVISION TERMINAL VIA A GLOBAL COMMUNICATION NETWORK.