发明名称 Preprocessing text to enhance statistical features
摘要 A document preprocessor preprocess a document to enhance the statistical features of the document. The system preprocesses the document by matching a prefix and a trailing context in the document with one or more matching prefixes in a transformation database, where the prefix is a first string of one or more tokens in the first document and the trailing context is a second string of one or more tokens in the first document that trail the prefix. Alternatively, the system preprocesses the document by computing cyclic permutations of the document, sorting these permutations and taking the last token from each of the sorted permutations.
申请公布号 US8527500(B2) 申请公布日期 2013.09.03
申请号 US20090395319 申请日期 2009.02.27
申请人 SCHNEIDER JAMES PAUL;RED HAT, INC. 发明人 SCHNEIDER JAMES PAUL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址