发明名称 Dynamically building an unstructured information management architecture (UIMA) pipeline
摘要 A pipeline development environment includes a toolset that includes a visual design editor. The editor comprises a display interface having a palette of known Annotators that may be selected by a developer. The pipeline development environment also includes or has associated therewith a data repository. The data repository stores datasets. A particular dataset is associated with an Annotator and comprises dependency data generated from execution of a pipeline (or some portion thereof). The repository typically stores datasets from many pipeline runs, including runs of other pipelines, multiple runs of a given pipeline with different inputs, etc. Using the editor, a developer creates a visual representation of the pipeline. As Annotators are added into the pipeline, system tooling dynamically generates the descriptor files and other configuration parameters (for the new pipeline), preferably based on the dependency data associated with the individual Annotators and retrieved from the repository.
申请公布号 US9280340(B2) 申请公布日期 2016.03.08
申请号 US201414242401 申请日期 2014.04.01
申请人 International Business Machines Corporation 发明人 O'Keeffe William Graham;Karle Christopher James;Taieb David Deidou
分类号 G06F9/44 主分类号 G06F9/44
代理机构 代理人 Sarbakhsh Reza;Judson David H.
主权项 1. A method of building a software system pipeline comprising a set of elements, comprising: storing dependency data for one or more elements, the dependency data for at least a particular element having been derived from a data model generated as a result of executing at least one other pipeline in which the particular element was included; identifying a first element; associating a second element to the first element to form a portion of the software system pipeline, wherein the particular element is one of the first and second elements; retrieving dependency data for at least one of the first and second elements, at least some of the retrieved dependency data being dependency data that was derived from the data model; and automatically generating descriptor data for the software system pipeline based on the retrieved dependency data; wherein at least one of the steps is carried out in software executing in a hardware element.
地址 Armonk NY US