发明名称 Query and matching for content recognition
摘要 Various embodiments enable audio data, such as music data, to be captured, by a device, from a background environment and processed to formulate a query that can then be transmitted to a content recognition service. In one or more embodiments, multiple queries are transmitted to the content recognition service. In at least some embodiments, subsequent queries can progressively incorporate previous queries plus additional data that is captured. In one or more embodiments, responsive to receiving the query, the content recognition service can employ a multi-stage matching technique to identify content items responding to the query. This matching technique can be employed as queries are progressively received.
申请公布号 US8880545(B2) 申请公布日期 2014.11.04
申请号 US201113110185 申请日期 2011.05.18
申请人 Microsoft Corporation 发明人 Koishida Kazuhito;Nister David;Simon Ian;Butcher Tom
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人 Spellman Steven;Ross Jim;Minhas Micky
主权项 1. One or more computer-readable storage media comprising instructions that are executable to cause a device to perform operations comprising: capturing, using a computing device, audio data, at least some of which is processable for provision to a content recognition service; formulating, by applying a Hamming window to the audio data and further processing the audio data at the computing device, a query for submission to the content recognition service to identify displayable content information associated with the audio data; submitting a first query to a content recognition service, the first query being formulated using one or more features extracted from a first portion of the audio data, each of the one or more features comprising at least spectral peak data for use in identifying the displayable content information associated with the audio data; responsive to an indication that no displayable content information is received based on the first query, submitting one or more subsequent queries to the content recognition service, the one or more subsequent queries comprising at least one of the one or more features extracted from the first portion of the audio data and used to formulate the first query, along with additional features not included in the first query; and terminating said submitting the one or more subsequent queries responsive to receiving the displayable content information from the content recognition service.
地址 Redmond WA US