发明名称 Smart selection of text spans
摘要 A text span forming either a single word or a series of two or more words that a user intended to select is predicted. A document and a location pointer that indicates a particular location in the document are received and input to different candidate text span generation methods. A ranked list of one or more scored candidate text spans is received from each of the different candidate text span generation methods. A machine-learned ensemble model is used to re-score each of the scored candidate text spans that is received from each of the different candidate text span generation methods. The ensemble model is trained using a machine learning method and features from a dataset of true intended user text span selections. A ranked list of re-scored candidate text spans is received from the ensemble model.
申请公布号 US9436918(B2) 申请公布日期 2016.09.06
申请号 US201414245646 申请日期 2014.04.04
申请人 MICROSOFT TECHNOLOGY LICENSING, LLC 发明人 Pantel Patrick;Gamon Michael;Fuxman Ariel Damian;Kohlmeier Bernhard;Chilakamarri Pradeep
分类号 G06N3/04;G06N99/00;G06N7/00;G06F3/0484;G06F17/30 主分类号 G06N3/04
代理机构 代理人 Corie Alin;Swain Sandy;Minhas Micky
主权项 1. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising: using a computer to perform the following process actions: receiving a document comprising a string of characters; receiving a location pointer indicating a particular location in the document; inputting the document and the location pointer to a plurality of different candidate text span generation methods; receiving a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods; using a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections; and receiving a ranked list of re-scored candidate text spans from the ensemble model.
地址 Redmond WA US