发明名称 GENERATING HYPERLINKS AND ANCHOR TEXT IN HTML AND NON-HTML DOCUMENTS
摘要 Systems and methods for generation of hyperlinks and anchor text from data such as reference text in HTML and in non-HTML documents are disclosed. The method generally includes locating a text reference in a source document, searching using a search engine for a target document relating to the text reference, computing anchor text from the text reference, generating a hyperlink to the target document, and associating the hyperlink with the computed anchor text. The locating and/or computing may be based on a respective statistical model of text formatting and/or lexical cues. The tex t reference may be parsed into pieces such that the searching, computing, generating, and associating are performed for each piece of text. The source document may be an HTML or non-HTML document. The text reference may be a reference to, for example, a paper, article, company, institution, product, search engine, image, object, and geographical location.
申请公布号 CA2551840(A1) 申请公布日期 2005.07.21
申请号 CA20042551840 申请日期 2004.12.30
申请人 GOOGLE, INC. 发明人 MITTAL, VIBHU
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项
地址