发明名称 |
Systems and methods for information extraction using contextual pattern discovery |
摘要 |
Described herein are methods, systems, apparatuses and products for automatically discovering patterns in a text corpus. An aspect provides extracting at least one context string related to at least one annotator from the at least one text corpus; analyzing the at least one context string for at least one sequence, the at least one sequence comprised of at least one subsequence; determining at least one sequence signature for each at least one sequence by applying applicable rules to the at least one sequence; and grouping the at least one sequence signature into at least one group. |
申请公布号 |
US8630989(B2) |
申请公布日期 |
2014.01.14 |
申请号 |
US201113117570 |
申请日期 |
2011.05.27 |
申请人 |
BLOHM SEBASTIAN JOHANNES;CHU VIVIAN YAW-WEN;HO CHING-TIEN;LI YUNYAO;ZHU HUAIYU;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
BLOHM SEBASTIAN JOHANNES;CHU VIVIAN YAW-WEN;HO CHING-TIEN;LI YUNYAO;ZHU HUAIYU |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|