发明名称 Applying a structured language model to information extraction
摘要 One feature of the present invention uses the parsing capabilities of a structured language model in the information extraction process. During training, the structured language model is first initialized with syntactically annotated training data. The model is then trained by generating parses on semantically annotated training data enforcing annotated constituent boundaries. The syntactic labels in the parse trees generated by the parser are then replaced with joint syntactic and semantic labels. The model is then trained by generating parses on the semantically annotated training data enforcing the semantic tags or labels found in the training data. The trained model can then be used to extract information from test data using the parses generated by the model.
申请公布号 US7805302(B2) 申请公布日期 2010.09.28
申请号 US20020151979 申请日期 2002.05.20
申请人 MICROSOFT CORPORATION 发明人 CHELBA CIPRIAN;MAHAJAN MILIND
分类号 G10L15/00;G06F17/27;G10L15/18 主分类号 G10L15/00
代理机构 代理人
主权项
地址