发明名称 PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION
摘要 A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
申请公布号 US2011222771(A1) 申请公布日期 2011.09.15
申请号 US20100721949 申请日期 2010.03.11
申请人 MICROSOFT CORPORATION 发明人 CIMPOI MIRCEA;GALIC SASA;VUGDELIJA MILAN
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址
您可能感兴趣的专利