发明名称 AUTO-TAGGER THAT LEARNS
摘要 Examples perform context categorization by categorizing a transcription of a speech file based on the context of the subject matter of the transcription. A computer processor is configuration to provide a system that generates a normalized transcription of the speech file transcription and compares elements of the normalized transcription to elements of a context categorization model or a corpus of categorized transcriptions to determine whether the normalized transcription contains keywords of transcriptions that have previously been categorized. If the comparison yields a result indicating a specific match with a context category, the normalized transcription is assigned to the matching context category. As the number of successfully categorized transcriptions stored in the corpus increases, the more frequently the system and method examples perform successful comparisons. As a result, the context categorization accuracy increases and the system appears to learn.
申请公布号 US2015154956(A1) 申请公布日期 2015.06.04
申请号 US201314095483 申请日期 2013.12.03
申请人 Cellco Partnership d/b/a Verizon Wireless 发明人 Brown Deborah Washington
分类号 G10L15/26 主分类号 G10L15/26
代理机构 代理人
主权项 1. A method, comprising steps of: obtaining, by a processor, a transcription of a spoken message for categorization by assigning a context category to the transcription, wherein the context category indicates a subject matter context of the transcription; parsing the obtained transcription to identify words in the obtained transcription; accessing, in a data storage, a corpus of categorized transcriptions, wherein the corpus of categorized transcriptions is a database containing a plurality of data records, and each data record includes a keyword field and a category field related to a categorized transcription; retrieving a first data record of a context category from the accessed corpus of categorized transcriptions; comparing the identified words from the obtained transcription to keywords of the retrieved first data record; in response to a comparison result indicating a sufficient match to the obtained transcription is not found in the first data record, retrieving a second data record of the context category from the assessed corpus of categorized transcriptions; comparing the identified words from the obtained transcription to keywords of the second data record; and in response to the comparison yielding a match of the identified words with the second data record to sufficient degree, storing data related to the obtained transcription in a statistics processing file in the data storage.
地址 Basking Ridge NJ US
您可能感兴趣的专利