发明名称 Systems and methods for information extraction using contextual pattern discovery
摘要 Described herein are methods, systems, apparatuses and products for automatically discovering patterns in a text corpus. An aspect provides extracting at least one context string related to at least one annotator from the at least one text corpus; analyzing the at least one context string for at least one sequence, the at least one sequence comprised of at least one subsequence; determining at least one sequence signature for each at least one sequence by applying applicable rules to the at least one sequence; and grouping the at least one sequence signature into at least one group.
申请公布号 US8630989(B2) 申请公布日期 2014.01.14
申请号 US201113117570 申请日期 2011.05.27
申请人 BLOHM SEBASTIAN JOHANNES;CHU VIVIAN YAW-WEN;HO CHING-TIEN;LI YUNYAO;ZHU HUAIYU;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BLOHM SEBASTIAN JOHANNES;CHU VIVIAN YAW-WEN;HO CHING-TIEN;LI YUNYAO;ZHU HUAIYU
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址