发明名称
摘要 PROBLEM TO BE SOLVED: To extract the feature quantity of a document and determine similarity at a very high speed. SOLUTION: A document-processing device is equipped with: a data-inputting part 101 for inputting document data to be processed; a morpheme analyzing part 102 which divides a document into morphemes; a statistic data analyzing part 103 which acquires the frequencies of the respective morphemes in the document; a waveform converting part 104 which associates the respective morphemes with frequencies and also associates the frequencies of morphemes with intensities to generate frequency spectrum data which correspond to the document data; a DA conversion part 105 which converts the frequency spectrum data to waveform data; a pass filter part 106 which filters the waveform data by means of specific processing; and a resonance filter part 107 which extracts similar waveform data from among a plurality of waveform data after filtering. COPYRIGHT: (C)2011,JPO&INPIT
申请公布号 JP5488808(B2) 申请公布日期 2014.05.14
申请号 JP20100014264 申请日期 2010.01.26
申请人 发明人
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址