发明名称 |
Character string dividing or separating method and related system for segmenting agglutinative text or document into words |
摘要 |
A joint probability of two neighboring characters appearing in a given Japanese document database is statistically calculated. The calculated joint probabilities are stored in a table. An objective Japanese sentence is segmented into a plurality of words with reference to the calculated joint probabilities so that each division point of the objective Japanese sentence is present between two neighboring characters having a smaller joint probability.
|
申请公布号 |
US2001009009(A1) |
申请公布日期 |
2001.07.19 |
申请号 |
US20000745795 |
申请日期 |
2000.12.26 |
申请人 |
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. |
发明人 |
IIZUKA YASUKI |
分类号 |
G06F17/21;G06F17/27;G06F17/28;G06F17/30;(IPC1-7):G06F15/00 |
主分类号 |
G06F17/21 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|