发明名称 SYSTEM AND METHOD FOR DETERMING COPY DOCUMENT USING FREQUENCY WORD AND SYSTEM AND METHOD FOR EXTRACTION FREQUENCY WORD
摘要 A copy document determining system using a frequent phrase and a method thereof, and a frequent phrase extracting system and a method thereof are provided to search phrases by extracting identification data which shows the frequency of a document based on index data including the identification data of the document. A document frequency determining unit(102) extracts ID data for showing a preset document frequency by using index data. An ID data set generator(103) produces ID data of each document from a newly collected document set, and then a ID data set by excluding the extracted ID data. An index data generator(104) produces index data corresponding to the ID data by using the ID data set. A copy document determining unit(105) inquires a having an overlapped ID data through the generated index data to determine whether the duplication exists.
申请公布号 KR20090095208(A) 申请公布日期 2009.09.09
申请号 KR20080020397 申请日期 2008.03.05
申请人 NHN CORPORATION 发明人 SIM, KYU CHEOL
分类号 G06F17/21;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址