发明名称 Method for converting formatted documents to ordered word lists
摘要 <p>A computer implemented method is applied to convert a formatted document or text to an ordered list of words. The formatted document is first partitioned into first and second data structures stored in a memory of a computer. The first data structure stores text fragments, and the second data structure stores code fragments of the formatted document. Adjacent text fragments are concatenated to form possible ordered word lists. Possible words are matched against a dictionary of representative words. A best ordered word list having the fewest number of words is selected from the possible ordered word lists. &lt;IMAGE&gt;</p>
申请公布号 EP0878766(A2) 申请公布日期 1998.11.18
申请号 EP19980303195 申请日期 1998.04.24
申请人 DIGITAL EQUIPMENT CORPORATION 发明人 EUSTACE, ROBERT A.;DION, JEREMY
分类号 G06F17/30;G06F17/22;G06F17/27;(IPC1-7):G06F17/22 主分类号 G06F17/30
代理机构 代理人
主权项
地址