发明名称 METHODS AND SYSTEM FOR DOCUMENT RECONSTRUCTION
摘要 Some embodiments provide a method for analyzing an unstructured document that includes a number of glyphs, each of which has a position in the unstructured document. Based on positions of the glyphs in the unstructured document, the method creates associations between different sets of glyphs in order to identify different sets of glyphs as different words. The method creates associations between different sets of words in order to identify different sets of words as different paragraphs. The method defines associations between paragraphs that are not contiguous in order to define a reading order for the paragraphs.
申请公布号 KR101324799(B1) 申请公布日期 2013.11.01
申请号 KR20117018126 申请日期 2009.12.31
申请人 发明人
分类号 G06F9/44;G06F17/21;G06F17/27 主分类号 G06F9/44
代理机构 代理人
主权项
地址