发明名称 Automatic segmentation of continuous text using statistical approaches
摘要 An automatic segmenter for continuous text segments such text in a rapid, consistent and semantically accurate manner. Two statistical methods for segmentation of continuous text are used. The first method, called "forward-backward matching", is easy and fast but can produce occasional errors in long phrases. The second method, called "statistical stack search segmenter", utilizes statistical language models to generate more accurate segmentation output at an expense of two times more execution time than the "forward-backward matching" method. In some applications where speed is a major concern, "forward-backward matching" can be used, while in other applications where highly accurate output is desired, "statistical stack search segmenter" is ideal.
申请公布号 US5806021(A) 申请公布日期 1998.09.08
申请号 US19960700823 申请日期 1996.09.04
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHEN, CHENGJUN JULIAN;LIU, FU-HUA;PICHENY, MICHAEL ALAN
分类号 G06F17/27;(IPC1-7):G06F17/27;G06F17/20 主分类号 G06F17/27
代理机构 代理人
主权项
地址