摘要 |
Computer-readable media, computer systems, and computing devices facilitate enhancing a web index with uniform resource locator (URL)/non-encoding character (NEC) word pairs to facilitate relevance ranking of search results provided in response to a search query that includes NEC words. URLs are received from web pages and substrings extracted therefrom. Additional elements are received from the web page, word-broken into sequences of NEC words, and the NEC words are converted into encoding-language representations which are matched against the URL substrings to identify candidate URL/NEC pairs for utilization in relevance ranking. |