发明名称 System and method for synchronized text display and audio playback
摘要 An audio processing system and method for providing synchronized display of recognized text from an original audio file and playback of the original audio file. The system includes a speech recognition module, a silence insertion module, and a silence detection module. The speech recognition module generates text and audio pieces. The silence insertion module, aggregates the audio pieces into an aggregated audio file. The silence detection module converts the original audio file and the aggregated audio file into silence detected versions. Silent and non-silent blocks are identified using a threshold volume. The silence insertion module compares the silence detected original and aggregated audio files, determines the differences in position of non-silence elements and inserts silence within the audio pieces accordingly. The characteristics of the silence inserted audio pieces are used to synchronize the display of recognized text from an original audio file and playback of original audio file.
申请公布号 US2005080633(A1) 申请公布日期 2005.04.14
申请号 US20030681428 申请日期 2003.10.08
申请人 MITRA IMAGING INCORPORATED 发明人 LUECK MICHAEL F.;LOWE ROBERT A.;VAN DOKKUMBURG STEVEN R.
分类号 G10L11/02;G10L13/00;G10L15/00;G10L15/04;G10L15/22;(IPC1-7):G10L11/00 主分类号 G10L11/02
代理机构 代理人
主权项
地址