发明名称 System and method for facts extraction and domain knowledge repository creation from unstructured and semi-structured documents
摘要 Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
申请公布号 US7756807(B1) 申请公布日期 2010.07.13
申请号 US20080237059 申请日期 2008.09.24
申请人 GLENNBROOK NETWORKS 发明人 KOMISSARCHIK JULIA;KOMISSARCHIK EDWARD
分类号 G06N5/02 主分类号 G06N5/02
代理机构 代理人
主权项
地址