发明名称 MATCHING ENGINE WITH SIGNATURE GENERATION AND RELEVANCE DETECTION
摘要 <p>A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.</p>
申请公布号 WO2006122086(A2) 申请公布日期 2006.11.16
申请号 WO2006US17846 申请日期 2006.05.08
申请人 DGATE TECHNOLOGIES, INC.;REN, LIWEI;TAN, DEHUA;HUANG, FEI;HUANG, SHU;DONG, AIGUO 发明人 REN, LIWEI;TAN, DEHUA;HUANG, FEI;HUANG, SHU;DONG, AIGUO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址