发明名称 |
System, plug-in, and method for improving text composition by modifying character prominence according to assigned character information measures |
摘要 |
A computer implemented system and method for composing a formatted text input to improve legibility, readability and/or print economy while preserving the format of the text input and satisfying any user selected aesthetic constraints. An information measure (IM) is assigned to each character in a language unit. Multiple different IMs are assigned to each character and combined to form a combined IM (CIM) for each character indicating the predictability of that character to differentiate the language unit from other language units. The process is repeated for at least a plurality of language units and typically until all the text input has been analyzed and information measures assigned to all of the characters. |
申请公布号 |
US8755629(B2) |
申请公布日期 |
2014.06.17 |
申请号 |
US201213632018 |
申请日期 |
2012.09.30 |
申请人 |
Language Technologies, Inc. |
发明人 |
Bever Thomas G.;Nicholas Christopher D.;Hancock Roeland;Alcock Keith W.;Jandreau Steven M. |
分类号 |
G06K9/00 |
主分类号 |
G06K9/00 |
代理机构 |
|
代理人 |
Gifford Eric A. |
主权项 |
1. A computer-implemented method embodied in a non-transitory medium, said method configured to execute the computer-implemented steps of:
a) reading in successive blocks of formatted text input having defined characters including letters and punctuation including spaces; b) examining a language unit in the text input of the current block, said language unit including a lexical unit, a sub-lexical unit or a subset of only punctuation including spaces; c) computing a plurality of information measures (IMs) selected from lexical, extra-lexical, sub-lexical and sub-character informativeness IMs for each character in the language unit; d) combining the plurality of IMs into a single combined information measure (CIM) for each said character indicating the predictability of that character to differentiate the language unit from other language units; e) repeating steps b through d for a plurality of language units in the current block; and f) outputting a list of CIMs for each character in the plurality of language units in the current block.
|
地址 |
Tucson AZ US |