发明名称 Extensible surface for consuming information extraction services
摘要 Representing structured data extracted from unstructured data in fashion allowing querying using relational database concepts. A method includes receiving user input specifying one or more database views. The method further includes receiving user input specifying an information extraction technique, such as an extraction workflow. The method further includes receiving user input specifying a corpus of data. The extraction technique is applied to the corpus of data to produce the one or more database views. These views can then be queried or operated on using database tools.
申请公布号 US9064004(B2) 申请公布日期 2015.06.23
申请号 US201113040939 申请日期 2011.03.04
申请人 Microsoft Technology Licensing, LLC 发明人 DeRose Pedro Dantas
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 代理人 Haslam Brian;Hoff Aaron;Minhas Micky
主权项 1. In a computing environment, a method of representing structured data extracted from unstructured data in a fashion which allows querying using relational database concepts, the method comprising: receiving user input specifying one or more database views; receiving user input specifying an information extraction technique, the information extraction technique defining how to extract structured data from unstructured data and the information extraction technique comprising a phrase semantic extraction technique which determines a semantic relationship about one or more words based upon a semantic environment of the one or more words; receiving user input specifying a corpus of data comprising unstructured data, the unstructured data comprising data that is not organized semantically such that it does not have a formalized type and is not in a formal entity level relationship; and applying the extraction technique to the corpus of data to extract structured data from the unstructured data of the corpus of data and to produce the one or more database views including the extracted structured data.
地址 Redmond WA US