FEATURE OPTIMIZATION AND RELIABILITY ESTIMATION FOR AUDIO AND VIDEO SIGNATURE GENERATION AND DETECTION
摘要
Features are extracted from video and audio content that have a known temporal relationship with one another. The extracted features are used to generate video and audio signatures, which are assembled with an indication of the temporal relationship into a synchronization signature construct. the construct may be used to calculate synchronization errors between video and audio content received at a remote destination. Measures of confidence are generated at the remote destination to optimize processing and to provide an indication of reliability of the calculated synchronization error.