发明名称 連続引用判定装置及び方法
摘要 PROBLEM TO BE SOLVED: To detect that a plurality of documents are continuously excerpted.SOLUTION: The present invention comprises: cutting a character string segment out of one sentence of an input document; determining a start point of the character string segment; making a digest, in which a character string corresponding to a character string segment for each predetermined number of characters from the start point has been converted into a hash function, slide by a predetermined number of characters, and storing a document ID and a digest group of the digest in a digest DB; reading out the digest from the digest DB; and determining that a plurality of documents are continuously excerpted, when excerpting, in a character string segment separated by a predetermined window size w of the digest, a document having a different digest.
申请公布号 JP5906229(B2) 申请公布日期 2016.04.20
申请号 JP20130229439 申请日期 2013.11.05
申请人 日本電信電話株式会社 发明人 船越 要;鷲崎 誠司
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址