发明名称 LOCALIZED AUDIO SOURCE EXTRACTION FROM VIDEO RECORDINGS
摘要 Technologies are generally described for a system to process a collection of video recordings of a scene to extract and localize audio sources for the audio data. According to some examples, video recordings captured by mobile devices from different perspectives may be uploaded to a central database. Video segments capturing an overlapping portion of the scene at an overlapping time may be identified, and a relative location of each of the video capturing devices may be determined. Audio data for the video segments may be indexed with a sub-frame time reference and relative locations as a function of overlapping time. Using the indices that include the sub-frame time references and relative locations, audio sources for the audio data may be extracted and localized. The extracted audio sources may be transcribed and indexed to enable searching, and may be added back to each video recording as a separate audio channel.
申请公布号 US2015350716(A1) 申请公布日期 2015.12.03
申请号 US201314380698 申请日期 2013.12.09
申请人 Empire Technology Development LLC 发明人 Kruglick Ezekiel
分类号 H04N21/43;H04N21/44;H04N21/81;H04N21/439;G11B27/10;G11B27/036 主分类号 H04N21/43
代理机构 代理人
主权项 1. A method to extract localized audio sources, the method comprising: identifying two video recordings of a scene captured by two spatially separate video capturing devices; identifying at least two video segments within the recordings capturing an overlapping visual frame of the scene recorded at an overlapping time frame; determining a location of each of the two video capturing devices during the overlapping time frame; indexing audio data recorded by the two video capture devices with a time reference and a location based on the determined locations of the video capturing devices; and localizing one or more audio sources for the audio data recorded by the two video capture devices based on the determined locations of the video capturing devices.
地址 Wilmington DE US