发明名称 Converting digital images containing text to token-based files for rendering
摘要 A computer-implemented method is provided for converting a scanned-in electronic image into a token-based file. The method includes generally five steps. First, various tokens (i.e., graphical units) are identified in the electronic image. Second, the identified tokens having similar shapes are classified together to form a token group, to thereby form multiple token groups, each including one or more tokens having similar shapes. Third, in each token group, a representative token is found, which morphologically represents the shapes of tokens included in the group. Fourth, each representative token is converted into a vectorized token, which is a mathematical representation of the shape of the representative token. Fifth, each of the vectorized tokens is associated with the positions of the tokens in the electronic image represented by the vectorized token. Thus, upon rendering, the vectorized token is displayed to thereby create a page image consisting only of clean images of vectorized tokens.
申请公布号 US7460710(B2) 申请公布日期 2008.12.02
申请号 US20060392213 申请日期 2006.03.29
申请人 AMAZON TECHNOLOGIES, INC. 发明人 COATH ADAM BRIAN;AKALIN FREDERICK ZIYA RAMOS;GOODWIN ROBERT L.;SHAGAM JOSHUA
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址