发明名称 Using speech to text for detecting commercials and aligning edited episodes with transcripts
摘要 Methods and apparatus, including computer program products, for using speech to text for detecting commercials and aligning edited episodes with transcripts. A method includes, receiving an original video or audio having a transcript, receiving an edited video or audio of the original video or audio, applying a speech-to-text process to the received original video or audio having a transcript, applying a speech-to-text process to the received edited video or audio, and applying an alignment to determine locations of the edits.
申请公布号 US9020817(B2) 申请公布日期 2015.04.28
申请号 US201313744585 申请日期 2013.01.18
申请人 Ramp Holdings, Inc. 发明人 Johnson R Paul;Lau Raymond
分类号 G10L15/26;G10L15/00;G11B27/00 主分类号 G10L15/26
代理机构 Chapin IP Law, LLC 代理人 Chapin IP Law, LLC
主权项 1. A method comprising: in a computer system having a processor and a memory, receiving an original video or audio having a transcript; receiving an edited video or audio of the original video or audio; applying a speech-to-text process to the received original video or audio having a transcript; applying a speech-to-text process to the received edited video or audio; and applying an alignment to determine locations of the edits.
地址 Woburn MA US