发明名称 Method and Apparatus for Discovering and Classifying Polysemous Word Instances in Web Documents
摘要 A method and apparatus for discovering polysemous words and classifying polysemous words found in web documents. All document corpi in any natural language have words that have multiple usage contexts or words that have multiple meanings. Semantic analysis is not feasible for classifying all word occurrences in all documents on the web, which contain trillions of words in total. In addition, semantic analysis typically cannot distinguish multiple usages of a given meaning of a given word. In one embodiment of this invention, polysemous words in natural languages can be discovered by analyzing the co-occurrence of other words with the polysemous word in web documents. In one embodiment, the multiple meanings and usages of a polysemous word can be determined by analyzing the co-occurrences of other words with the polysemous word. In one embodiment, counting overcorrelations is achieved probabilistically to minimize use of network bandwidth.
申请公布号 US2009157648(A1) 申请公布日期 2009.06.18
申请号 US20070957272 申请日期 2007.12.14
申请人 KING RICHARD MICHAEL 发明人 KING RICHARD MICHAEL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址