发明名称 System and method for classifying, publishing, searching and locating electronic documents
摘要 The present invention provides an electronic documents management system and method for classifying, publishing, searching and locating electronic documents. Electronic documents are classified and published via an ontological description consisting of at least one vector, each vector comprising at least one feature-value pair. Each vector's slot corresponds to a feature and the vector's range of each slot corresponds to the set of all possible values of each feature. In order to classify an electronic document, two hashing functions are applied. A first hashing function maps each feature to a slot number, corresponding to a coordinate in the appropriate vector. A second hashing function maps the value of each pair to a numeric value of a slot, corresponding to the range of each coordinate. The result of the two hashing functions is an ordered vector that can be mapped to a node in hypercube.
申请公布号 US8799289(B2) 申请公布日期 2014.08.05
申请号 US200511661860 申请日期 2005.09.01
申请人 Carmel-Haifa University Economic Corp. Ltd. 发明人 Ben Asher Yosef;Berkovsky Shlomo
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Browdy and Neimark, PLLC 代理人 Browdy and Neimark, PLLC
主权项 1. A computerized system for classifying, publishing, searching and locating electronic documents, said system comprising: (i) means for classifying and publishing electronic documents via an ontological description consisting of at least one vector, each vector comprising at least one feature-value pair wherein each slot of said at least one vector corresponds to a feature of said at least one feature-value pair and a range of each of said slots corresponds to a set of all possible values of said feature; (ii) means for storing each of said electronic documents in an electronic storage comprising: means for using a first hashing function to map said feature of each feature-value pair to a slot number, corresponding to a coordinate in said at least one vector;means for using a second hashing function to map said value of each feature-value pair to a numeric value of said slot corresponding to the range of said coordinate;means for measuring distances in a hypercube between two vectors by using said second hashing function to order the feature of each vector and then using said first hashing function to order the values of each vector and then applying a threshold operation on the vectors to measure their distance;means for creating a new ordered vector based on the results of said two hashing functions and the calculated distances; andmeans for mapping said new ordered vector to a node in a hypercube based on calculated distances; (iii) means for storing each of said electronic documents in a hypercube-like graph structure wherein each vertex of said hypercube can be recursively constructed of another hypercube; (iv) means for specifying search criteria for one or more electronic documents via an ontological description by enumerating at least one feature-value pair; (i) means for locating said one or more electronic documents according to the specified search criteria; and(ii) means users of said system to dynamically add and search new search criteria for electronic documents, said new search criteria comprising new features not previously defined by the system.
地址 Mount Carmel IL