发明名称 Using an ID domain to improve searching
摘要 Methods which use an ID domain to improve searching are described. An embodiment describes an index phase in which an image of a document is converted into the ID domain. This is achieved by dividing the text in the image into elements and mapping each element to an identifier. Similar elements are mapped to the same identifier. Each element in the text is then replaced by the appropriate identifier to create a version of the document in the ID domain. This version may be indexed and searched. Another embodiment describes a query phase in which a query is converted into the ID domain and then used to search an index of identifiers which has been created from collections of documents which have been converted into the ID domain. The conversion of the query may use mappings which were created during the index phase or alternatively may use pre-existing mappings.
申请公布号 US8538964(B2) 申请公布日期 2013.09.17
申请号 US201113314606 申请日期 2011.12.08
申请人 MAGDY WALID;EL-SABAN MOTAZ AHMED;MICROSOFT CORPORATION 发明人 MAGDY WALID;EL-SABAN MOTAZ AHMED
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址