发明名称 Vision-based document segmentation.
摘要 Vision-based document segmentation identifies one or more portions of semantic content of a document. The one or more portions are identified by identifying a plurality of visual blocks in the document, and detecting one or more separators between the visual blocks of the plurality of visual blocks. A content structure for the document is constructed based at least in part on the plurality of visual blocks and the one or more separators, and the content structure identifies the one or more portions of semantic content of the document. The content structure obtained using the vision-based document segmentation can optionally be used during document retrieval.
申请公布号 ZA200405370(B) 申请公布日期 2005.03.15
申请号 ZA20040005370 申请日期 2004.07.06
申请人 MICROSOFT CORPORATION 发明人 JI-RONG WEN;SHIPENG YU;DENG CAI;WEI-YING MA
分类号 G06F15/00;G06F;G06F17/00;G06F17/21;G06F17/22;G06F17/30;G06K9/72;G06K15/00 主分类号 G06F15/00
代理机构 代理人
主权项
地址