发明名称 |
SMART SELECTION OF TEXT SPANS |
摘要 |
A text span forming either a single word or a series of two or more words that a user intended to select is predicted. A document and a location pointer that indicates a particular location in the document are received and input to different candidate text span generation methods. A ranked list of one or more scored candidate text spans is received from each of the different candidate text span generation methods. A machine-learned ensemble model is used to re-score each of the scored candidate text spans that is received from each of the different candidate text span generation methods. The ensemble model is trained using a machine learning method and features from a dataset of true intended user text span selections. A ranked list of re-scored candidate text spans is received from the ensemble model. |
申请公布号 |
US2015100524(A1) |
申请公布日期 |
2015.04.09 |
申请号 |
US201414245646 |
申请日期 |
2014.04.04 |
申请人 |
Microsoft Corporation |
发明人 |
Pantel Patrick;Gamon Michael;Fuxman Ariel Damian;Kohlmeier Bernhard;Chilakamarri Pradeep |
分类号 |
G06N99/00;G06F3/0484;G06N7/00 |
主分类号 |
G06N99/00 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising:
using a computer to perform the following process actions: receiving a document comprising a string of characters; receiving a location pointer indicating a particular location in the document; inputting the document and the location pointer to a plurality of different candidate text span generation methods; receiving a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods; using a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections; and receiving a ranked list of re-scored candidate text spans from the ensemble model. |
地址 |
Redmond WA US |