Scalable lookup-driven entity extraction from indexed document collections,申请号US200812144675-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Scalable lookup-driven entity extraction from indexed document collections
摘要	A set of documents is filtered for entity extraction. A list of entity strings is received. A set of token sets that covers the entity strings in the list is determined. An inverted index generated on a first set of documents is queried using the set of token sets to determine a set of document identifiers for a subset of the documents in the first set. A second set of documents identified by the set of document identifiers is retrieved from the first set of documents. The second set of documents is filtered to include one or more documents of the second set that each includes a match with at least one entity string of the list of entity strings. Entity recognition may be performed on the filtered second set of documents.
申请公布号	US8782061(B2)	申请公布日期	2014.07.15
申请号	US200812144675	申请日期	2008.06.24
申请人	Microsoft Corporation	发明人	Agrawal Sanjay;Chakrabarti Kaushik;Chaudhuri Surajit;Ganti Venkatesh
分类号	G06F17/30;G06F7/00	主分类号	G06F17/30
代理机构		代理人	Choi Dan;Taylor Peter;Minhas Micky
主权项	1. A method for filtering a set of documents, comprising: receiving a list of entity strings; determining a set of token sets that covers the entity strings in the list, the number of tokens in the set of token sets being less than the number of words of the entity strings in the list of entity strings; querying an inverted index generated on a first set of documents using the set of token sets to determine a set of document identifiers for a subset of the documents in the first set; retrieving from the first set of documents a second set of documents, which is a subset of the first set of documents, identified by the set of document identifiers; and filtering the second set of documents to include one or more documents of the second set that each include a match with at least one entity string of the list of entity strings.
地址	Redmond WA US

您可能感兴趣的专利

一种单管气囊式瓦斯钻孔封孔装置

一种自动感应冲水马桶

一种带有裹包成型装置的自动包装机

一种低抗原仔猪配合饲料及其制备方法

预应力混凝土变截面鱼腹式连续箱梁施工方法

一种指纹防反锁系统

一种植物愈伤组织遗传转化和植物组织培养方法

高强度缓冲式泥石两用密封自卸车厢

一种在线取样翻杆的装置

精制麻辣酸肉酱

一种船用双人床

维帕他韦中间体的合成方法

双L型管幕结构控制框构桥顶进方法

一种可照明拖鞋

一种透水砼路面的施工方法

吸音墙纸的制作方法

带调味、保温混合机构的多味红薯干制作成套装备

无缝硬质壁纸及其制备方法

造纸助剂及其制备方法

一种帮病人起床的设备