发明名称 System, methods, and data structure for representing object and properties associations
摘要 Systems, methods, and data structures are disclosed for discovering and representing object-properties associations using textual data. The object can be a physical object or an abstract object. A system comprises a storage medium configured to store a data package containing one or more terms each comprising a word or phrase as names of properties or attributes associated with the object. The terms in the data package can collectively serve as an associative representation of the object. The data package can be used in a system for searching and classifying information based on concepts. The data package can be obtained by analyzing a plurality of text contents each containing a term defined as the object name, and other terms that are not the object name, and counting the number of occurrences of the text contents containing the object name and non-object name, or using a weighting co-efficient based on the grammatical roles of the terms, or using the frequencies of the terms in the external documents.
申请公布号 US9183274(B1) 申请公布日期 2015.11.10
申请号 US201313763716 申请日期 2013.02.10
申请人 发明人 Zhang Guangsheng
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项 1. A method implemented on a computing device comprising one or more processors and for producing a dataset representing associations between terms or objects, the method comprising: receiving, by the computing device, a first term comprising a word or a phrase, wherein the function of the first term includes representing an object or the name of an object, wherein the object comprises at least a physical object or a conceptual object; associating, by the computing device, a first dataset with the first term, wherein the first dataset comprises at least two terms each comprising a word or phrase, wherein the function of the at least two terms includes representing the names of at least two properties or attributes that are associated with the object, or representing at least two terms that are related to the first term, wherein the at least two terms are obtained from a plurality of text contents using a machine-based algorithm, wherein at least some of the text contents are not manually labeled as being related to the object or to the first term; and outputting, by the computing device, a second dataset comprising at least the first term and the first dataset, wherein the function of the second dataset includes at least providing an associative representation of the object, or a representation of the information associated with the object, or a representation of the first term by other terms associated with the first term.
地址