I-Vector Based Clustering Training Data in Speech Recognition,申请号US201213640804-传众专利搜索

发明名称	I-Vector Based Clustering Training Data in Speech Recognition
摘要	Methods and systems for i-vector based clustering training data in speech recognition are described. An i-vector may be extracted from a speech segment of a speech training data to represent acoustic information. The extracted i-vectors from the speech training data may be clustered into multiple clusters using a hierarchical divisive clustering algorithm. Using a cluster of the multiple clusters, an acoustic model may be trained. This trained acoustic model may be used in speech recognition.
申请公布号	US2015199960(A1)	申请公布日期	2015.07.16
申请号	US201213640804	申请日期	2012.08.24
申请人	Microsoft Corporation	发明人	Huo Qiang;Yan Zhi-Jie;Zhang Yu;Xu Jian
分类号	G10L15/06	主分类号	G10L15/06
代理机构		代理人
主权项	1. A computer-implemented method for clustering training data in speech recognition, the method comprising: extracting a plurality of i-vectors from speech data including a plurality of speech segments; clustering the plurality of i-vectors into a plurality of clusters; training an acoustic model using one of the plurality of clusters; and recognizing one or more other speech segments using the trained acoustic model.
地址	Redmond WA US