发明名称 Leveraging cross-document context to label entity
摘要 Entities, such as people, places and things, are labeled based on information collected across a possibly large number of documents. One or more documents are scanned to recognize the entities, and features are extracted from the context in which those entities occur in the documents. Observed entity-feature pairs are stored either in an in-memory store or an external store. A store manager optimizes use of the limited amount of space for an in-memory store by determining which store to put an entity-feature pair in, and when to evict features from the in-memory store to make room for new pairs. Feature that may be observed in an entity's context may take forms such as specific word sequences or membership in a particular list.
申请公布号 US7970808(B2) 申请公布日期 2011.06.28
申请号 US20080114824 申请日期 2008.05.05
申请人 MICROSOFT CORPORATION 发明人 KONIG ARND CHRISTIAN;GANTI VENKATESH
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址