发明名称 System and method for digital video retrieval involving speech recognition
摘要 Disclosed are systems, methods, and computer readable media for retrieving digital images. The method embodiment includes converting a descriptive audio stream of a digital video that is provided for the visually impaired to text and then aligning that text to the appropriate segment of the digital video. The system then indexes the converted text from the descriptive audio stream with the text's relationship to the digital video. The system enables queries using action words describing a desired scene from a digital video.
申请公布号 US9135336(B2) 申请公布日期 2015.09.15
申请号 US201313943220 申请日期 2013.07.16
申请人 AT&T Intellectual Property I, L.P. 发明人 Bangalore Srinivas
分类号 H04L29/06;G06F21/00;G06F17/30;G10L15/26;H04N21/439;H04N21/4402;H04N21/482;H04N21/8547 主分类号 H04L29/06
代理机构 代理人
主权项 1. A method comprising: converting a descriptive audio stream associated with a digital video to text; performing a non-textual optical analysis of a frame on the digital video, to yield an analysis; and aligning the text to frames in the digital video based on the analysis, a first bit rate associated with the digital video, and a second bit rate associated with the descriptive audio stream.
地址 Atlanta GA US