发明名称 Text structure analysis method and text structure analysis device
摘要 A text structure analysis method and apparatus in which the apparatus includes a content boundary pattern storage device for storing content boundary patterns indicating boundaries of various contents represented as collections of given contents of text and a text analysis device for detecting boundary sections present in the input text based on the contents stored in the content boundary pattern storage device. The text analysis device establishes content boundaries for those detected boundary sections. When extracted from the input text as contents for each collection of contents of that text, the content boundary patterns indicating the boundaries of the various contents are detected, content boundaries are established for that input text, and the text is treated in units of content for each collection of content based on the established content boundaries.
申请公布号 US6263336(B1) 申请公布日期 2001.07.17
申请号 US19980016226 申请日期 1998.01.30
申请人 SEIKO EPSON CORPORATION 发明人 TANAKA TOSHIO
分类号 G06F17/22;G06F17/27;(IPC1-7):G06F17/30 主分类号 G06F17/22
代理机构 代理人
主权项
地址
您可能感兴趣的专利