发明名称 Document categorisation system
摘要 A document categorisation system, including a clusterer for generating clusters of related electronic documents based on features extracted from said documents, and a filter module for generating a filter on the basis of said clusters to categorise further documents received by said system. The system may include an editor for manually browsing and modifying the clusters. The categorisation of the documents is based on n-grams, which are used to determine significant features of the documents. The system includes a trend analyzer for determining trends of changing document categories over time, and for identifying novel clusters. The system may be implemented as a plug-in module for a spreadsheet application, providing a convenient means for one-off or ongoing analysis of text entries in a worksheet.
申请公布号 US2006089924(A1) 申请公布日期 2006.04.27
申请号 US20050514470 申请日期 2005.08.24
申请人 RASKUTTI BHAVANI;KOWALCZYK ADAM 发明人 RASKUTTI BHAVANI;KOWALCZYK ADAM
分类号 G06F17/30;A63F9/08 主分类号 G06F17/30
代理机构 代理人
主权项
地址