发明名称 Identification of Layout and Content Flow of an Unstructured Document
摘要 Some embodiments provide a method for analyzing an unstructured document that includes a number of glyphs, each of which has a position in the unstructured document. Based on positions of the glyphs in the unstructured document, the method creates associations between different sets of glyphs in order to identify different sets of glyphs as different words. The method creates associations between different sets of words in order to identify different sets of words as different paragraphs. The method defines associations between paragraphs that are not contiguous in order to define a reading order for the paragraphs.
申请公布号 US2015324338(A1) 申请公布日期 2015.11.12
申请号 US201514710525 申请日期 2015.05.12
申请人 Apple Inc. 发明人 Levy Michael Robert;Mansfield Philip Andrew
分类号 G06F17/22;G06F17/28 主分类号 G06F17/22
代理机构 代理人
主权项
地址 Cupertino CA US