发明名称 METHOD AND SYSTEM FOR THE COLLECTION, DESCRIPTION, ORGANIZING AND HANDLING OF DIGITAL INTERNETIC DOCUMENTS BASED ON THEMATIC ONTOLOGY-DERIVED MEANINGS
摘要 he system disclosed herein is composed of 4 subsystems: the Thematic Crawler (TC), the Information Miner (IM), the Document Organizer (DO), the Database (DB) for the storage of the information, and the knowledge database (WordNet-WN) all installed on the same or different computers interconnected on a local network. The system is installed on an Internet connected computer and allows the automatic collection of thematic documents from the Internet by using ontologies and the hyperlinks interconnecting the Internet documents. At the entry of a thematic ontology O1, the information-collecting subsystem exports a collection of internetic documents. In the sequel, the export and information enrichment subsystem exports the most significant words of each document and sets them in correspondence to the ontology meanings. The document-organizing subsystem creates automatically batch documents, which are classified in non-predefined categories with respect to the similarity of the meanings of same documents. The information produced by the two subsystems IN & DO is stored on the database (DB). The query-answering subsystem (Query Manager-QM) receives at its entry queries (Q3) and answers them by supplying documents with meanings similarto those of the query. The documents are presented in a classified batch form (R3).
申请公布号 GR1004662(B) 申请公布日期 2004.08.25
申请号 GR20030100216 申请日期 2003.05.13
申请人 VARLAMIS IRAKLIS;VAZIRGIANNIS MICHAIL;CHALKIDI MARIA;NGUYEN BENJAMIN 发明人 VARLAMIS IRAKLIS;VAZIRGIANNIS MICHAIL;CHALKIDI MARIA;NGUYEN BENJAMIN
分类号 (IPC1-7):G06F17/60 主分类号 (IPC1-7):G06F17/60
代理机构 代理人
主权项
地址