发明名称 Part-of-speech tagging using latent analogy
摘要 Methods and apparatuses to assign part-of-speech tags to words are described. An input sequence of words is received. A global fabric of a corpus having training sequences of words may be analyzed in a vector space. A global semantic information associated with the input sequence of words may be extracted based on the analyzing. A part-of-speech tag may be assigned to a word of the input sequence based on POS tags from pertinent words in relevant training sequences identified using the global semantic information. The input sequence may be mapped into a vector space. A neighborhood associated with the input sequence may be formed in the vector space wherein the neighborhood represents one or more training sequences that are globally relevant to the input sequence.
申请公布号 US9053089(B2) 申请公布日期 2015.06.09
申请号 US200711906592 申请日期 2007.10.02
申请人 Apple Inc. 发明人 Bellegarda Jerome
分类号 G06F17/20;G06F17/28;G06F17/27;G10L15/18;G10L13/08 主分类号 G06F17/20
代理机构 Morrison & Foerster LLP 代理人 Morrison & Foerster LLP
主权项 1. A method, comprising: analyzing a corpus having first training sequences of words in a semantic vector space; extracting a global semantic information associated with an input sequence of words from the semantic vector space; selecting second training sequences of words having part-of-speech tags in the semantic vector space based on the global semantic information and the first training sequences; and assigning a part-of-speech tag to at least one word of the input sequence based on the part-of-speech tags of the second training sequences, wherein at least one of the analyzing, extracting, selecting, and assigning is performed by a processor.
地址 Cupertino CA US