发明名称 |
SYSTEMS AND METHODS FOR IMPROVED SPELL CHECKING |
摘要 |
The present invention leverages iterative transformations of search query strings along with statistics extracted from search query logs and/or web data to provide possible alternative spellings for the search query strings. This provides a spell checking means that can be influenced to provide individualized suggestions for each user. By utilizing search query logs, the present invention can account for substrings not found in a lexicon but still acceptable as a search query of interest. This allows a means to provide a higher quality proposal for alternative spellings, beyond the content of the lexicon. One instance of the present invention operates at a substring level by utilizing word unigram and/or bigram statistics extracted from query logs combined with an iterative search. This provides substantially better spelling alternatives for a given query than employing only substring matching. Other instances can receive input data from sources other than a search query input.
|
申请公布号 |
US2007106937(A1) |
申请公布日期 |
2007.05.10 |
申请号 |
US20070620171 |
申请日期 |
2007.01.05 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
CUCERZAN SILVIU-PETRU;BRILL ERIC D. |
分类号 |
G06F17/00;G06F17/21;G06F17/27;G06F17/30 |
主分类号 |
G06F17/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|