发明名称 CAPTIONING USING SOCIALLY DERIVED ACOUSTIC PROFILES
摘要 <p>Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to speakers and background sounds. For the at least one segment, a speaker providing speech in an audio track of the at least one segment is identified using information retrieved from a social network service source. A speech profile for the speaker is generated using information retrieved from the social network service source, an acoustic profile for the segment is generated based on the generated speech profile, and an automatic speech recognition engine is dynamically configured for operation on the at least one segment based on the acoustic profile. Automatic speech recognition operations are performed on the audio track of the at least one segment to generate a textual representation of speech content in the audio track corresponding to the speaker.</p>
申请公布号 WO2014049461(A1) 申请公布日期 2014.04.03
申请号 WO2013IB58011 申请日期 2013.08.27
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION;IBM UNITED KINGDOM LIMITED;IBM JAPAN LIMITED 发明人 YAN, SHUNGUO;WOODWARD, ELIZABETH, VERA
分类号 G10L15/10;G10L15/00;G10L15/22;G10L17/00 主分类号 G10L15/10
代理机构 代理人
主权项
地址