发明名称 METHOD AND SYSTEM FOR GENERATING BASIC DATA FOR JUDGING SIMILAR ELECTRONIC DOCUMENT
摘要 PURPOSE: A method and a system for generating basic data for judging a similar electronic document are provided to detect the similar electronic document through a computer even if contents are different with each other little by little. CONSTITUTION: A receiver(110) receives the electronic document. A token extractor(120) extracts a token by dividing the contents of the received electronic document into a predetermined unit. A token frequency calculator(130) calculates a frequency of each token extracted from the electronic document. A basic data generator(140) generates the basic data by reducing the electronic document to a predetermined size after removing the token of a low frequency.
申请公布号 KR20040011769(A) 申请公布日期 2004.02.11
申请号 KR20020044880 申请日期 2002.07.30
申请人 MOBIGEN, INC. 发明人 KIM, HYEONG GEUN
分类号 (IPC1-7):G06F17/60 主分类号 (IPC1-7):G06F17/60
代理机构 代理人
主权项
地址