发明名称 Method For Finding Elements In A Webpage Suitable For Use In A Voice User Interface (Disambiguation)
摘要 A disambiguation process for a voice interface for web pages or other documents. The process identifies interactive elements such as links, obtains one or more phrases of each interactive element, such as link text, title text and alternative text for images, and adds the phrases to a grammar which is used for speech recognition. A group of interactive elements are identified as potential best matches to a voice command when there is no single, clear best match. The disambiguation process modifies a display of the document to provide unique labels for each interactive element in the group, and the user is prompted to provide a subsequent spoke command to identify one of the unique labels. The selected unique label is identified and a click event is generated for the corresponding interactive element.
申请公布号 US2014350941(A1) 申请公布日期 2014.11.27
申请号 US201313899074 申请日期 2013.05.21
申请人 Microsoft Corporation 发明人 Zeigler Andrew Stephen;Kim Michael H.;Benson Rodger;Sarin Raman;Ju Yun-Cheng
分类号 G10L21/10 主分类号 G10L21/10
代理机构 代理人
主权项 1. A method for providing a voice user interface, comprising: analyzing a document to identify a plurality of interactive elements in the document, each interactive element of the plurality of interactive elements comprises an associated phrase; rendering the document to provide a display on a display device, the associated phrases are provided in the display; comparing a voice command of a user to a plurality of phrases, the plurality of phrases comprise the associated phrases of the plurality of interactive elements; based on the comparing, determining a matching score for each interactive element indicating a degree of matching of its associated phrase to the voice command; identifying one of the interactive elements as a closest match to the voice command based on its matching score; and based on the matching scores, deciding whether to generate a click event for the one of the interactive elements which is the closest match or to initiate a disambiguation process which allows the user to select from among a group of the interactive elements which comprise matching scores which are highest among the plurality of interactive elements.
地址 Redmond WA US