SPEAKER RECOGNITION USING LOCAL MODELS,申请号WO2004US09539-传众专利搜索

发明名称	SPEAKER RECOGNITION USING LOCAL MODELS
摘要	A system and method for voice recognition is disclosed. The system enrolls speakers using an enrollment voice samples and identification information. An extraction module characterizes enrollment voice samples with high-dimensional feature vectors or speaker data points. A data structuring module organizes data points into a high-dimensional data structure, such as a kd-tree, in which similarity between data points dictates a distance, such as a Euclidean distance, a Minkowski distance, or a Manhattan distance. The system recognizes a speaker using an unidentified voice sample. A data querying module searches the data structure to generate a subset of approximate nearest neighbors based on an extracted high-dimensional feature vector. A data modeling module uses Parzen windows to estimate a probability density function representing how closely characteristics of the unidentified speaker match enrolled speakers, in real-time, without extensive training data or parametric assumptions about data distribution. A smoothing parameter controls the relative contributions of close and far speaker data points to the estimated density.
申请公布号	WO2004088632(A2)	申请公布日期	2004.10.14
申请号	WO2004US09539	申请日期	2004.03.26
申请人	HONDA MOTOR CO., LTD.;RIFKIN, RYAN	发明人	RIFKIN, RYAN
分类号	G10L17/00	主分类号	G10L17/00
代理机构		代理人
主权项
地址