发明名称 Speech recognition system including an image capturing device and oral cavity tongue detecting device, speech recognition device, and method for speech recognition
摘要 A speech recognition system is to be used on a human subject. The speech recognition system includes an image capturing device, an oral cavity detecting device and a speech recognition device. The image capturing device captures images of lips of the subject during a speech of the subject. The oral cavity detecting device detects contact with a tongue of the subject and distance from the tongue of the subject, and accordingly generates a contact signal and a distance signal. The speech recognition device processes the images of the lips and the contact and distance signals so as to obtain content of the speech of the subject.
申请公布号 US9424842(B2) 申请公布日期 2016.08.23
申请号 US201514809739 申请日期 2015.07.27
申请人 Liu Ching-Feng;Chen Hsiao-Han 发明人 Liu Ching-Feng;Chen Hsiao-Han
分类号 G10L15/25;G10L15/08;G10L15/24;G10L21/10 主分类号 G10L15/25
代理机构 Muncy, Geissler, Olds & Lowe, P.C. 代理人 Muncy, Geissler, Olds & Lowe, P.C.
主权项 1. A speech recognition system to be used on a human subject, said speech recognition system comprising: an image capturing device for successively capturing images of lips of the subject during a speech of the subject; an oral cavity detecting device including a carrier base configured to be mounted in an oral cavity of the subject at a palate of the subject,a contact detecting unit disposed on said carrier base, and configured to detect contact with a tongue of the subject and to generate a contact signal according to the contact with the tongue during the speech of the subject, anda distance detecting unit disposed on said carrier base, and configured to detect a distance from the tongue of the subject and to generate a distance signal according to the distance from the tongue; and a speech recognition device coupled to said image capturing device and said oral cavity detecting device for respectively receiving the images of the lips of the subject and the contact and distance signals, and programmed to process the images of the lips and the contact and distance signals so as to obtain content of the speech of the subject.
地址 Kaohsiung TW