发明名称 System, plug-in, and method for improving text composition by modifying character prominence according to assigned character information measures
摘要 A computer implemented system and method for composing a formatted text input to improve legibility, readability and/or print economy while preserving the format of the text input and satisfying any user selected aesthetic constraints. An information measure (IM) is assigned to each character in a language unit. Multiple different IMs are assigned to each character and combined to form a combined IM (CIM) for each character indicating the predictability of that character to differentiate the language unit from other language units. The process is repeated for at least a plurality of language units and typically until all the text input has been analyzed and information measures assigned to all of the characters.
申请公布号 US8755629(B2) 申请公布日期 2014.06.17
申请号 US201213632018 申请日期 2012.09.30
申请人 Language Technologies, Inc. 发明人 Bever Thomas G.;Nicholas Christopher D.;Hancock Roeland;Alcock Keith W.;Jandreau Steven M.
分类号 G06K9/00 主分类号 G06K9/00
代理机构 代理人 Gifford Eric A.
主权项 1. A computer-implemented method embodied in a non-transitory medium, said method configured to execute the computer-implemented steps of: a) reading in successive blocks of formatted text input having defined characters including letters and punctuation including spaces; b) examining a language unit in the text input of the current block, said language unit including a lexical unit, a sub-lexical unit or a subset of only punctuation including spaces; c) computing a plurality of information measures (IMs) selected from lexical, extra-lexical, sub-lexical and sub-character informativeness IMs for each character in the language unit; d) combining the plurality of IMs into a single combined information measure (CIM) for each said character indicating the predictability of that character to differentiate the language unit from other language units; e) repeating steps b through d for a plurality of language units in the current block; and f) outputting a list of CIMs for each character in the plurality of language units in the current block.
地址 Tucson AZ US
您可能感兴趣的专利