发明名称 Method and System of Indexing Speech Data
摘要 A method and system of indexing speech data. The method includes indexing word transcripts including a timestamp for a word occurrence; and indexing sub-word transcripts including a timestamp for a sub-word occurrence. A timestamp in the index indicates the time and duration of occurrence of the word or sub-word in the speech data, and word and sub-word occurrences can be correlated using the timestamps. A method of searching speech transcripts is also provided in which a search query in the form of a phrase to be searched includes at least one in-vocabulary word and at least one out-of-vocabulary word. The method of searching includes extracting the search terms from the phrase, retrieving a list of occurrence of words for an in-vocabulary search term from an index of words having timestamps, retrieving a list of occurrences of sub-words for an out-of-vocabulary search term from an index of sub-words having timestamps, and merging the retrieved lists of occurrences of words and sub-words according to their timestamps.
申请公布号 US2009030680(A1) 申请公布日期 2009.01.29
申请号 US20070781285 申请日期 2007.07.23
申请人 发明人 MAMOU JONATHAN JOSEPH
分类号 G10L15/26;G10L11/00;G10L15/00 主分类号 G10L15/26
代理机构 代理人
主权项
地址