摘要 |
A method and apparatus for identifying words described in a page description file. A computer device stores a page description language file which includes characters that have not been identified as words by the page description language. A word identifying mechanism reads the page description language file and groups characters to form at least one word from the characters. The system preferably transfers words to a client process capable of processing words at a request of the client process. In a method for identifying words from a page description file, characters are read from the file and are stored in a word buffer until a word break is detected based upon character position data stored in the file. The contents of the word buffer are then provided to a client process as an identified word. The method can also sort the characters from the file into a display order prior to storing the characters in the word buffer. The method and apparatus can be used for searching for words in a page description file.
|