发明名称 System and method for using anchor text as training data for classifier-based search systems
摘要 A computer implemented information retrieval system is provided. The system includes a user input configured to receive a user query relative to the corpus. A machine learning classifier is trained with a first set of training data comprising anchor text relative to at least some of the documents in the corpus. A processing unit is adapted to interact with the classifier to obtain search results relative to the query using the machine learning classifier. In some aspects, the classifier is also trained with a second set of training data. A method of integrating a new document into a corpus of documents is also provided. A method of training a machine learning classifier for retrieving documents from a corpus using two distinct types of training data is also provided.
申请公布号 US7480667(B2) 申请公布日期 2009.01.20
申请号 US20040023856 申请日期 2004.12.24
申请人 MICROSOFT CORPORATION 发明人 HARR CHEN;RATNAPARKHI ADWAIT;KNOLL SONJA S.;HON HSIAO-WUEN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址