发明名称 DOCUMENT RETRIEVAL/IDENTIFICATION USING TOPICS
摘要 A system for retrieving/identifying a document comprising text stored in a document repository is described. A memory stores a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents. At least some of the nodes have one or more annotations each denoting a topic. A node relatedness calculator computes distances between nodes of the graphical structure using the topic annotations. An input receives an identifier of a user who is represented by one of the first plurality of nodes. An identifier/retriever identifies one or more documents from the document repository by using the identifier and using the computed distances between nodes.
申请公布号 US2016232157(A1) 申请公布日期 2016.08.11
申请号 US201514615156 申请日期 2015.02.05
申请人 Microsoft Technology Licensing, LLC 发明人 Mansour Riham Hassan Abdel-Moneim;Ashour Ahmed Adel Mohamed Abdel Kader;Abdelwahab El Baz Hesham Saad Mohamed
分类号 G06F17/30;G06K9/62;G06K9/00 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for retrieving/identifying a document comprising text stored in a document repository comprising: a memory storing a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents, at least some of the nodes having one or more annotations each denoting a topic, an interaction of the interactions at least partially based on at least one of: a consumption activity by a person represented by a first node of the first plurality of nodes of a document represented by a first node of the second plurality of nodes, ora relationship between a first person represented by the first node of the first plurality of nodes, and a second person represented by a second node of the first plurality of nodes; a node relatedness calculator arranged to compute distances between nodes of the graphical structure using the topic annotations; an input arranged to receive at least an identifier of a user who is represented by one of the first plurality of nodes; and an identifier/retriever arranged to identify one or more documents from the document repository by using the identifier and using the computed distances between nodes.
地址 Redmond WA US