发明名称 DATA CURATION SYSTEM WITH VERSION CONTROL FOR WORKFLOW STATES AND PROVENANCE
摘要 A data curation system that includes various methods to enable efficient reuse of human and machine effort. To reuse effort, various facilities are presented that model, save, and allow the querying of provenance and state information of a curation workflow and allow for incremental, stateful transitions of the data and the metadata.
申请公布号 US2016048542(A1) 申请公布日期 2016.02.18
申请号 US201414474919 申请日期 2014.09.02
申请人 Tamr, Inc. 发明人 Gluzman Peregrine Vladimir;Ilyas Ihab F.;Stonebraker Michael Ralph;Zdonik Stan;Palmer Andrew H.;Pagan Alexander Richter;Bruckner Daniel Meir;Beskales George;Turmukhametova Aizana;Zhu Tianyu;Kshetri Kanak;Liu Jason;Bates-Haus Nikolaus
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for data curation with version control comprising: a system operator; data experts; a curation processes module; a state creation and manipulation module; an update handler module; and a curation states and provenance datastore; wherein said curation processes module is operable to send questions to said data experts andreceive opinions from said data experts about said data curation; wherein said curation processes module is operable to output a curation proposal to said system operator andoutput state changes and provenance to said state creation and manipulation module; wherein said state creation and manipulation module is operable to output candidate changes to said update handler module; wherein said update handler module is operable to output an update proposal to said system operator,input update approvals from said system operator,output final changes to said state creation and manipulation module; wherein said state creation and manipulation module is operable to output new states and provenance to said curation states and provenance datastore,wherein said new states and provenance is a child state of the former parent state, wherein said parent state, said child state, and information about the action that the changed said parent state to said child state are all stored in said curation states and provenance datastore, thereby enabling said version control for said data curation.
地址 Cambridge MA US