摘要 |
Text sections and each comic character within each of at least one scanned comic frame are identified. Text is captured from each of the identified text sections using optical character recognition (OCR) of each of the identified text sections. A sequence of the text sections is determined based upon grammatical conventions of a language within which the at least one scanned comic frame is presented. An audio output model is identified for each of the determined sequence of the text sections. The at least one scanned comic frame is stored with the captured text, the determined sequence of the text sections, and the identified audio output model for each of the determined sequence of the text sections. This abstract is not to be considered limiting, since other embodiments may deviate from the features described in this abstract. |