发明名称 EXTENSIBLE SYSTEM AND METHOD FOR INFORMATION EXTRACTION IN A DATA PROCESSING SYSTEM
摘要 A data mashup system having information extraction capabilities for receiving multiple streams of textual data, at least one of which contains unstructured textual data. A repository stores annotators that describe how to analyze the streams of textual data for specified unstructured data components. The annotators are applied to the data streams to identify and extract the specified data components according to the annotators. The extracted data components are tagged to generate structured data components and the specified unstructured data components in the input data streams are replaced with the tagged data components. The system then combines the tagged data from the multiple streams to form a mashup output data stream.
申请公布号 US2011295853(A1) 申请公布日期 2011.12.01
申请号 US20100788142 申请日期 2010.05.26
申请人 LI YUNYAO;REISS FREDERICK R.;SIMMEN DAVID E.;THALAMATI SURESH;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 LI YUNYAO;REISS FREDERICK R.;SIMMEN DAVID E.;THALAMATI SURESH
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址