发明名称 METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION
摘要 A method of recognizing speech is provided that includes generating a decoding network that includes a primary sub-network and a classification sub-network. The primary sub-network includes a classification node corresponding to the classification sub-network. The classification sub-network corresponds to a group of uncommon words. A speech input is received and decoded by instantiating a token in the primary sub-network and passing the token through the primary network. When the token reaches the classification node, the method includes transferring the token to the classification sub-network and passing the token through the classification sub-network. When the token reaches an accept node of the classification sub-network, the method includes returning a result of the token passing through the classification sub-network to the primary sub-network. The result includes one or more words in the group of uncommon words. A string corresponding to the speech input is output that includes the one or more words.
申请公布号 US2014236591(A1) 申请公布日期 2014.08.21
申请号 US201414263958 申请日期 2014.04.28
申请人 Tencent Technology (Shenzhen) Company Limited 发明人 YUE Shuai;Lu Li;Zhang Xiang;Xie Dadong;Chen Bo;Rao Feng
分类号 G10L15/22 主分类号 G10L15/22
代理机构 代理人
主权项 1. A method of recognizing speech, comprising: generating a decoding network for decoding speech input, the decoding network comprising a primary sub-network and one or more classification sub-networks, wherein: the primary sub-network includes a plurality of classification nodes, each classification node corresponding to a respective classification sub-network of the one or more classification sub-networks; andeach classification sub-network of the one or more classification sub-networks corresponds to a group of uncommon words; receiving a speech input; and decoding the speech input by: instantiating a token corresponding to the speech input in the primary sub-network;passing the token through the primary network;when the token reaches a respective classification node of the plurality of classification nodes, transferring the token to the corresponding classification sub-network;passing the token through the corresponding classification sub-network;when the token reaches an accept node of the classification sub-network, returning a result of the token passing through the classification sub-network to the primary sub-network, wherein the result includes one or more words in the group of uncommon words corresponding to the classification sub-network outputting a string corresponding to the speech input that includes the one or more words.
地址 Shenzhen CN