发明名称 METADATA EXTRACTION OF NON-TRANSCRIBED VIDEO AND AUDIO STREAMS
摘要 A system and computer based method for transcribing and extracting metadata from a source media. A processor-based server extracts audio and video stream from the source media. A speech recognition engine processes the audio and/or video stream to transcribe the audio and/or video stream into a time-aligned textual transcription and to extract audio amplitude by time interval, thereby providing a time-aligned machine transcribed media. The server processor measures the aural amplitude of the extracted audio amplitude and assigns a numerical value that is normalized to a single, normalized, universal amplitude scale. A database stores the time-aligned machine transcribed media, time-aligned video frames and the assigned value from the normalized amplitude scale.
申请公布号 US2016163318(A1) 申请公布日期 2016.06.09
申请号 US201614988580 申请日期 2016.01.05
申请人 DATASCRIPTION LLC 发明人 WILDER JONATHAN;DEANGELIS, JR. KENNETH;SCHONFELD MAURICE W.
分类号 G10L15/26;G06K9/34;G06K9/00;G10L15/25;G10L25/57 主分类号 G10L15/26
代理机构 代理人
主权项
地址 BEVERLY HILLS CA US