摘要 |
According to an embodiment, a data processing device includes an extractor, a generator, and a constructor. The extractor is configured to extract, from a document having been subjected to predicate argument structure analysis and anaphora resolution, an element sequence including elements each being a combination of predicate having a shared argument and case type information of the shared argument, together with the shared argument. The generator is configured to produce case example data expressed by a feature vector for each attention element which is one of the elements. The feature vector includes feature value(s) about a sub-sequence having the attention element and feature value(s) about a sequence of the shared argument corresponding to the sub-sequence. The constructor is configured to construct a script model for estimating the elements each following antecedent context by performing machine learning based on a discriminative model using the case example data. |