发明名称 Extraction of information from documents
摘要 An information extraction model is trained on format features identified within labeled training documents. Information from a document is extracted by assigning labels to units based on format features of the units within the document. A begin label and end label are identified and the information is extracted between the begin label and the end label. The extracted information can be used in various document processing tasks such as ranking.
申请公布号 US7469251(B2) 申请公布日期 2008.12.23
申请号 US20050192687 申请日期 2005.07.29
申请人 MICROSOFT CORPORATION 发明人 LI HANG;SONG RUIHUA;CAO YUNBO;MEYERZON DMITRIY
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址