发明名称 Identification of semantic relationships within reported speech
摘要 Methods and computer-readable media for associating words or groups of words distilled from content, such as reported speech or an attitude report, of a document to form semantic relationships collectively used to generate a semantic representation of the content are provided. Semantic representations may include elements identified or parsed from a text portion of the content, the elements of which may be associated with other elements that share a semantic relationship, such as an agent, location, or topic relationship. Relationships may also be developed by associating one element that is in relation to, or is about, another element, thereby allowing for rapid and effective comparison of associations found in a semantic representation with associations derived from queries. The semantic relationships may be determined based on semantic information, such as potential meanings and grammatical functions of each element within the text portion of the content.
申请公布号 US8868562(B2) 申请公布日期 2014.10.21
申请号 US200812201675 申请日期 2008.08.29
申请人 Microsoft Corporation 发明人 Crouch Richard S.;Van Den Berg Martin Henk;Ahn David;Gurevich Olga;Pell Barney D.;Polanyi Livia;Prevost Scott A.;Thione Giovanni Lorenzo
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人 Ream Dave;Taylor Peter;Minhas Micky
主权项 1. A computer-implemented method for developing semantic relationships between elements distilled from content of a document to generate a semantic representation of the content, the method comprising: identifying, by way of a computing device having a processor and memory, a text portion of the document; determining semantic information for a plurality of elements identified in the text portion, the semantic information including one or more of meanings of the identified elements or grammatical functions of the identified elements; identifying at least one of the identified elements as a subject of the text portion; determining a plurality of levels of association from the text portion and identifying at least one of the identified elements as a reporting act corresponding to an attitude report for each of the plurality of levels of association, the reporting act identified based on a set of rules that utilizes, in part, surrounding text, wherein the attitude report describes the subject's attitude toward a particular topic of the text portion; based on the determined semantic information for the identified elements, associating the identified elements so that each association of identified elements represents a certain semantic relationship; generating, by way of the computing device, a semantic representation that represents the associations, by way of relational elements that describe the associations, of the identified elements to one another; and indexing the semantic representation, including the identified elements and the relational elements, in an index for retrieval, the index being searchable and including pointers from the semantic representation to its associated text portion.
地址 Redmond WA US