发明名称 System and Method for Determining Concepts in a Content Item Using Context
摘要 The present invention is directed towards systems and methods for indexing one or more items of content. The method of the present invention comprises extracting one or more items of text from a given item of content. The one or more items of extracted text are tokenized into one or more concepts. One or more related concepts associated with the one or more concepts are identified. A support score is generated for the one or more concepts, and the item of content is index with the one or more concepts and the one or more associated support scores.
申请公布号 US2014365499(A1) 申请公布日期 2014.12.11
申请号 US201414467339 申请日期 2014.08.25
申请人 Yahoo! Inc. 发明人 PARIKH JIGNASHU;Thrall John
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method implemented on at least one machine having at least one processor, storage, and a communication platform connected to a network for processing one or more items of content, the method comprising: receiving a search query; identifying, based on an index of items of content, a set of items of content responsive to the search query, wherein the index comprises individual items of content with corresponding one or more concepts and corresponding support scores for each of the one or more concepts, wherein the support score for each concept is determined based on whether that concept appears in the corresponding item of content; obtaining, for each individual item of content in the set, a sum of the support scores associated with the one or more concepts that are related to the search query; and providing the set, wherein the items of content in the set are sorted based on the sum of the support scores.
地址 Sunnyvale CA US