发明名称 Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
摘要 Methods, systems, and computer-readable media related to a technique for combining two or more aspects of predictive information for auto-completion of user input, in particular, user commands directed to an intelligent digital assistant. Specifically, predictive information based on (1) usage frequency, (2) usage recency, and (3) semantic information encapsulated in an ontology (e.g., a network of domains) implemented by the digital assistant, are integrated in a balanced and sensible way within a unified framework, such that a consistent ranking of all completion candidates across all domains may be achieved. Auto-completions are selected and presented based on the unified ranking of all completion candidates.
申请公布号 US9582608(B2) 申请公布日期 2017.02.28
申请号 US201414298720 申请日期 2014.06.06
申请人 Apple Inc. 发明人 Bellegarda Jerome R.
分类号 G06F17/21;G06F17/30;G06F17/27 主分类号 G06F17/21
代理机构 Morrison & Foerster LLP 代理人 Morrison & Foerster LLP
主权项 1. A method of providing cross-domain semantic ranking of complete input phrases for a digital assistant, comprising: receiving a training corpus comprising a collection of complete input phrases that span a plurality of semantically distinct domains; for each of a plurality of distinct words present in the collection of complete input phrases, calculating a respective word indexing power across the plurality of domains based on a respective normalized entropy for said word, wherein the respective normalized entropy is based on a total number of domains in which said word appears and how representative said word is for each of the plurality of domains; for each complete input phrase in the collection of complete input phrases, calculating a respective phrase indexing power across the plurality of domains based on an aggregation of the respective word indexing powers of all constituent words of said complete input phrase; obtaining respective domain-specific usage frequencies of the complete input phrases in the training corpus; and generating a cross-domain ranking of the collection of complete input phrases based at least on the respective phrase indexing powers of the complete input phrases and the respective domain-specific usage frequencies of the complete input phrases.
地址 Cupertino CA US