发明名称 Character string dividing or separating method and related system for segmenting agglutinative text or document into words
摘要 A joint probability of two neighboring characters appearing in a given Japanese document database is statistically calculated. The calculated joint probabilities are stored in a table. An objective Japanese sentence is segmented into a plurality of words with reference to the calculated joint probabilities so that each division point of the objective Japanese sentence is present between two neighboring characters having a smaller joint probability.
申请公布号 US2001009009(A1) 申请公布日期 2001.07.19
申请号 US20000745795 申请日期 2000.12.26
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 IIZUKA YASUKI
分类号 G06F17/21;G06F17/27;G06F17/28;G06F17/30;(IPC1-7):G06F15/00 主分类号 G06F17/21
代理机构 代理人
主权项
地址