发明名称 Methods and apparatus for semantic unit based automatic indexing and searching in data archive systems
摘要 An audio-based data indexing and retrieval system for processing audio-based data associated with a particular language, comprising: (i) memory for storing the audio-based data; (ii) a semantic unit based speech recognition system for generating a textual representation of the audio-based data, the textual representation being in the form of one or more semantic units corresponding to the audio-based data; (iii) an indexing and storage module, operatively coupled to the semantic unit based speech recognition system and the memory, for indexing the one or more semantic units and storing the one or more indexed semantic units; and (iv) a search engine, operatively coupled to the indexing and storage module and the memory, for searching the one or more indexed semantic units for a match with one or more semantic units associated with a user query, and for retrieving the stored audio based data based on the one or more indexed semantic units. The semantic unit may preferably be a syllable or morpheme. Further, the invention is particularly well suited for use with Asian and Slavic languages.
申请公布号 US7177795(B1) 申请公布日期 2007.02.13
申请号 US19990437971 申请日期 1999.11.10
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHEN CHENGIUN JULIAN;KANEVSKY DIMITRI
分类号 G06F17/27;G06F17/30;G10L15/00;G10L15/08;G10L15/18 主分类号 G06F17/27
代理机构 代理人
主权项
地址