发明名称 Method and Apparatus for Using Convolutional Neural Networks in Speech Recognition
摘要 Speech recognition techniques are employed in a variety of applications and services serving large numbers of users. As such, there is an increasing demand for speech recognition systems with enhanced performance. Specifically, enhanced performance in large vocabulary continuous speech recognition (LVCSR) systems is a market demand. Herein, convolutional neural networks are explored as an alternative speech recognition approach and different CNN architectures are tested. According to at least one example embodiment, a method and corresponding apparatus for performing speech recognition comprise employing a CNN with at least two convolutional layers and at least two fully-connected layers in speech recognition. Using the CNN a textual representation of input audio data may be provided based on output data by the CNN.
申请公布号 US2015032449(A1) 申请公布日期 2015.01.29
申请号 US201313952455 申请日期 2013.07.26
申请人 Nuance Communications, Inc. 发明人 Sainath Tara N.;Mohamed Abdel-Rahman S.;Kingsbury Brian E. D.;Ramabhadran Bhuvana
分类号 G10L15/26 主分类号 G10L15/26
代理机构 代理人
主权项 1. A method of performing speech recognition, the method comprising: processing, by a cascade of at least two convolutional layers of a convolutional neural network, feature parameters extracted from audio data; processing, by a cascade of at least two fully connected layers of the convolutional neural network, output of the cascade of the at least two consecutive convolutional layers; and providing a textual representation of the input audio data based on the output of a last layer of the at least two consecutive fully connected layers of the convolutional neural network.
地址 Burlington MA US