发明名称 TEXT ANALYSIS TECHNIQUES
摘要 One embodiment of the present invention includes means determining a concept representation for a set of text documents based on partial order analysis and modifying this representation if it is determined to be unidentifiable. Furthermore, the embodiment includes means for labeling the representation, mapping documents to it to provide a corresponding document representation, generating a number of document signatures each of a different type, and performing several data processing applications each with a different one of the document signatures of differing types.
申请公布号 US2008109454(A1) 申请公布日期 2008.05.08
申请号 US20060556437 申请日期 2006.11.03
申请人 WILLSE ALAN R;HETZLER ELIZABETH G;HOPE LAWRENCE L;TANASSE THEODORE E;HAVRE SUSAN L;TURNER ALAN E;MACGREGOR MARGARET;NANCARROW CATHERINE;NAKAMURA GRANT C 发明人 WILLSE ALAN R.;HETZLER ELIZABETH G.;HOPE LAWRENCE L.;TANASSE THEODORE E.;HAVRE SUSAN L.;TURNER ALAN E.;MACGREGOR MARGARET;NANCARROW CATHERINE;NAKAMURA GRANT C.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址