发明名称 PASSAGE DIVISION METHOD, DEVICE AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To solve such a problem that in the conventional method, it is difficult to correctly divide a passage when a plurality of passages containing sentences with kindred meaning and similar feature quantity are included in one document.SOLUTION: A passage division device 100, under control of a control unit 101, divides a document input from an input unit 102 into sentence units at a sentence division unit 103. A feature quantity calculation unit 104, with the divided sentence as a query, performs associative retrieval of a document which is stored beforehand in a corpus unit 111 and acquires a document vector. A similarity calculation unit 105 retrieves two document vectors whose similarity becomes maximum, and when the similarity is equal to or larger than a prescribed threshold, a retrieval query generation unit 106 consolidates the two sentences to generate a query as a common element. The feature quantity calculation unit 104 regenerates a document vector by using this query. A feature quantity update unit 107 updates the feature quantity on the basis of its reliability, and connects corresponding sentences sequentially to make a passage while updating the feature quantity.
申请公布号 JP2013222418(A) 申请公布日期 2013.10.28
申请号 JP20120095344 申请日期 2012.04.19
申请人 HITACHI LTD 发明人 KAKISHITA YASUKI;HATTORI HIDEHARU;MURAKAMI TOMOKAZU;KONICHI OSAMU
分类号 G06F17/30;G06F17/21;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项
地址