发明名称 Predicting and Enhancing Document Ingestion Time
摘要 A mechanism is provided in a data processing system for predicting and enhancing ingestion time for a set of input documents. The mechanism receives a set of documents to be added to a corpus of the data processing system. The mechanism records document features of each document within the set of documents using an annotation engine within the data processing system. The mechanism predicts an ingestion time for each document within the set of documents based on the document characteristics and a machine learning model. The mechanism assigns the set of documents to data processing system resources to be processed based on the predicted ingestion time for each document.
申请公布号 US2015317561(A1) 申请公布日期 2015.11.05
申请号 US201414266959 申请日期 2014.05.01
申请人 International Business Machines Corporation 发明人 Allen Corville O.;Freed Andrew R.
分类号 G06N5/04;G06N99/00;G06F17/30 主分类号 G06N5/04
代理机构 代理人
主权项 1. A method, in a data processing system, for predicting and enhancing ingestion time for a set of input documents, the method comprising: receiving a set of documents to be added to a corpus of documents; recording document features of each document within the set of documents using an annotation engine within the data processing system; predicting an ingestion time for each document within the set of documents based on the document characteristics and a machine learning model; and assigning the set of documents to data processing system resources to be processed based on the predicted ingestion time for each document.
地址 Armonk NY US