发明名称 DATA PROCESSING DEVICE AND SCRIPT MODEL CONSTRUCTION METHOD
摘要 According to an embodiment, a data processing device includes an extractor, a generator, and a constructor. The extractor is configured to extract, from a document having been subjected to predicate argument structure analysis and anaphora resolution, an element sequence including elements each being a combination of predicate having a shared argument and case type information of the shared argument, together with the shared argument. The generator is configured to produce case example data expressed by a feature vector for each attention element which is one of the elements. The feature vector includes feature value(s) about a sub-sequence having the attention element and feature value(s) about a sequence of the shared argument corresponding to the sub-sequence. The constructor is configured to construct a script model for estimating the elements each following antecedent context by performing machine learning based on a discriminative model using the case example data.
申请公布号 US2016012040(A1) 申请公布日期 2016.01.14
申请号 US201514837197 申请日期 2015.08.27
申请人 KABUSHIKI KAISHA TOSHIBA ;TOSHIBA SOLUTIONS CORPORATION 发明人 Hamada Shinichiro
分类号 G06F17/28;G06K9/00;G06F17/27 主分类号 G06F17/28
代理机构 代理人
主权项 1. A data processing device comprising: an extractor configured to extract, from a document having been subjected to predicate argument structure analysis and anaphora resolution, an element sequence in which a plurality of elements are arranged in order of appearances of predicates in the document, the elements each being a combination of the predicate having a shared argument and case type information indicating a type of a case of the shared argument, together with the shared argument; a case example generator configured to produce case example data expressed by a feature vector for each attention element, the attention element being one of the elements included in the element sequence, the feature vector including at least one of one or more feature values about a sub-sequence having the attention element as a last element of the sub-sequence in the element sequence and one or more feature values about a sequence of the shared argument corresponding to the sub-sequence; and a model constructor configured to construct a script model for estimating the elements each following antecedent context by performing machine learning based on a discriminative model using the case example data.
地址 Tokyo JP