主权项 |
1. A method, comprising steps of:
obtaining, by a processor, a transcription of a spoken message for categorization by assigning a context category to the transcription, wherein the context category indicates a subject matter context of the transcription; parsing the obtained transcription to identify words in the obtained transcription; accessing, in a data storage, a corpus of categorized transcriptions, wherein the corpus of categorized transcriptions is a database containing a plurality of data records, and each data record includes a keyword field and a category field related to a categorized transcription; retrieving a first data record of a context category from the accessed corpus of categorized transcriptions; comparing the identified words from the obtained transcription to keywords of the retrieved first data record; in response to a comparison result indicating a sufficient match to the obtained transcription is not found in the first data record, retrieving a second data record of the context category from the assessed corpus of categorized transcriptions; comparing the identified words from the obtained transcription to keywords of the second data record; and in response to the comparison yielding a match of the identified words with the second data record to sufficient degree, storing data related to the obtained transcription in a statistics processing file in the data storage. |