发明名称 SYSTEMS AND METHODS FOR EXTRACTING SPECIFIED DATA FROM NARRATIVE TEXT
摘要 Embodiments are directed to extracting specified data items from narrative text. In one scenario, a computer system accesses narrative textual information which includes data items that are to be identified and extracted. The computer system identifies specified data items in the narrative textual information that are to be extracted from the narrative textual information. The computer system then filters the identified data items to remove false positive identifications. The false positive filtering includes classifying the identified data items as specified data items, so that classified data items are identified as true positive items that are to be extracted from the narrative textual information. The computer system further extracts, from the narrative textual information, those filtered data items that were classified as being true positive items.
申请公布号 US2014350965(A1) 申请公布日期 2014.11.27
申请号 US201414285343 申请日期 2014.05.22
申请人 Meystre Stéphane Michael;Escamez Óscar Ferrández 发明人 Meystre Stéphane Michael;Escamez Óscar Ferrández
分类号 G06F19/00 主分类号 G06F19/00
代理机构 代理人
主权项 1. A computer system comprising the following: one or more processors; system memory; one or more computer-readable storage media having stored thereon computer-executable instructions that, when executed by the one or more processors, causes the computing system to perform a method for extracting specified data items from narrative text, the method comprising the following: accessing one or more portions of narrative textual information, the narrative textual information including one or more data items that are to be identified and extracted;identifying one or more specified data items in the narrative textual information that are to be extracted from the narrative textual information, wherein identifying includes at least one of the following: performing a dictionary-based search, performing a pattern-based search and implementing machine learning to identify the data items that are to be extracted;filtering the identified data items to remove false positive identifications, the false positive filtering including classifying the identified data items as specified data items, such that classified data items are identified as true positive items that are to be extracted from the narrative textual information; andextracting, from the narrative textual information, those filtered data items that were classified as being true positive items.
地址 Sandy UT US