发明名称 Methods and apparatus for performing transformation techniques for data clustering and/or classification
摘要 Some aspects include transforming data, at least a portion of which has been processed to determine at least one representative vector associated with each of a plurality of classifications associated with the data to obtain a plurality of representative vectors. Techniques comprise determining a first transformation based, at least in part, on the plurality of representative vectors, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.
申请公布号 US9117444(B2) 申请公布日期 2015.08.25
申请号 US201213569792 申请日期 2012.08.08
申请人 Nuance Communications, Inc. 发明人 Rachevsky Leonid;Kanevsky Dimitri;Ramabhadran Bhuvana
分类号 G10L15/00;G10L15/14;G10L21/00;G10L15/06;G06N99/00;G06K9/62 主分类号 G10L15/00
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A method of classifying input data representing speech input by a user of a speech application as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data representing a plurality of speech utterances associated with the speech application, the method comprising: using at least one processor to perform: obtaining a first transformation generated based, at least in part, on a plurality of representative vectors determined from the training data, the plurality of representative vectors including at least one representative vector determined for each of the plurality of classifications, wherein the plurality of clusters were fit to the training data at least in part by fitting the plurality of clusters to transformed training data obtained by applying the first transformation to the training data;transforming the input data using at least the first transformation to obtain transformed input data representing the speech input by the user of the speech application;comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; andclassifying the input data according to a classification of the plurality of classifications associated with the determined cluster.
地址 Burlington MA US