发明名称 SMART SELECTION OF TEXT SPANS
摘要 A text span forming either a single word or a series of two or more words that a user intended to select is predicted. A document and a location pointer that indicates a particular location in the document are received and input to different candidate text span generation methods. A ranked list of one or more scored candidate text spans is received from each of the different candidate text span generation methods. A machine-learned ensemble model is used to re-score each of the scored candidate text spans that is received from each of the different candidate text span generation methods. The ensemble model is trained using a machine learning method and features from a dataset of true intended user text span selections. A ranked list of re-scored candidate text spans is received from the ensemble model.
申请公布号 US2015100524(A1) 申请公布日期 2015.04.09
申请号 US201414245646 申请日期 2014.04.04
申请人 Microsoft Corporation 发明人 Pantel Patrick;Gamon Michael;Fuxman Ariel Damian;Kohlmeier Bernhard;Chilakamarri Pradeep
分类号 G06N99/00;G06F3/0484;G06N7/00 主分类号 G06N99/00
代理机构 代理人
主权项 1. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising: using a computer to perform the following process actions: receiving a document comprising a string of characters; receiving a location pointer indicating a particular location in the document; inputting the document and the location pointer to a plurality of different candidate text span generation methods; receiving a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods; using a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections; and receiving a ranked list of re-scored candidate text spans from the ensemble model.
地址 Redmond WA US