发明名称 Recognizer of text-based work
摘要 Described herein is a technology for recognizing the content of text documents. The technology determines one or more hash values for the content of a text document. Alternatively, the technology may generate a "sifted text" version of a document. In one implementation described herein, document recognition is used to determine whether the content of one document is copied (i.e., plagiarized) from another document. This is done by comparing hash values of documents (or alternatively their sifted text). In another implementation described herein, document recognition is used to categorize the content of a document so that it may be grouped with other documents in the same category. This abstract itself is not intended to limit the scope of this patent. The scope of the present invention is pointed out in the appending claims.
申请公布号 US2002172425(A1) 申请公布日期 2002.11.21
申请号 US20010843255 申请日期 2001.04.24
申请人 VENKATESAN RAMARATHNAM;MALKIN MICHAEL 发明人 VENKATESAN RAMARATHNAM;MALKIN MICHAEL
分类号 G06F17/22;G06F17/27;(IPC1-7):G06K9/72 主分类号 G06F17/22
代理机构 代理人
主权项
地址