发明名称 PREDICTING RESULTS FOR INPUT DATA BASED ON A MODEL GENERATED FROM CLUSTERS
摘要 <p>A method for predicting results for input data based on a model that is generated based on clusters of related characters, clusters of related segments, and training data. The method comprises receiving a data set that includes a plurality of words in a particular language. In the particular language, words are formed by characters. Clusters of related characters are formed from the data set. A model is generated based at least on the clusters of related characters and training data. The model may also be based on the clusters of related segments. The training data includes a plurality of entries, wherein each entry includes a character and a designated result for said character. A set of input data that includes characters that have not been associated with designated results is received. The model is applied to the input data to determine predicted results for characters within the input data.</p>
申请公布号 WO2007142982(A1) 申请公布日期 2007.12.13
申请号 WO2007US12762 申请日期 2007.05.30
申请人 PENG, FUCHUN;YAHOO! INC. 发明人 PENG, FUCHUN
分类号 G06F15/18;G06F17/00 主分类号 G06F15/18
代理机构 代理人
主权项
地址