发明名称 System and method for language extraction and encoding
摘要 A computerized method for extracting information from natural-language text data includes parsing the text data to determine the grammatical structure of the text data and regularizing the parsed text data to form structured word terms. The parsing step, which can be performed in one or more parsing modes, includes the step of referring to a domain parameter having a value indicative of a domain from which the text data originated, wherein the domain parameter corresponds to one or more rules of grammar within a knowledge base related to the domain to be applied for parsing the text data. Preferably, the structured output is mapped back to the words in the original sentences of the text data input using XML tags.
申请公布号 AU6526300(A) 申请公布日期 2001.03.05
申请号 AU20000065263 申请日期 2000.08.04
申请人 THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 发明人 CAROL FRIEDMAN
分类号 G06F17/30;G06F17/27;G06F17/28;G06Q50/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址