发明名称 METHOD AND DEVICE FOR AUDIO RECOGNITION
摘要 A method and device for performing audio recognition, including: collecting a first audio document to be recognized; initiating calculation of first characteristic information of the first audio document, including: conducting time-frequency analysis for the first audio document to generate a first preset number of phase channels; and extracting at least one peak value characteristic point from each phase channel of the first preset number of phrase channels, where the at least one peak value characteristic point of each phase channel constitutes the peak value characteristic point sequence of said each phase channel; and obtaining a recognition result for the first audio document, wherein the recognition result is identified based on the first characteristic information, and wherein the first characteristic information is calculated based on the respective peak value characteristic point sequences of the preset number of phase channels.
申请公布号 US2014219461(A1) 申请公布日期 2014.08.07
申请号 US201314103753 申请日期 2013.12.11
申请人 Tencent Technology (Shenzhen) Company Limited 发明人 LIU Hailong;XIE Dadong;HOU Jie;XIAO Bin;LIU Xiao;CHEN Bo
分类号 G10L19/02;G10L19/018 主分类号 G10L19/02
代理机构 代理人
主权项 1. A method of performing audio recognition, comprising: at a device having one or more processors and memory: collecting a first audio document to be recognized in response to an audio recognition request;initiating calculation of first characteristic information of the first audio document, comprising: conducting time-frequency analysis for the first audio document to generate a first preset number of phase channels for the first audio document; andextracting at least one peak value characteristic point from each phase channel of the first preset number of phrase channels, where the at least one peak value characteristic point of each phase channel constitutes the peak value characteristic point sequence of said each phase channel; andobtaining a recognition result for the first audio document, wherein the recognition result includes at least one second audio document having second characteristic information matching the first characteristic information in accordance with one or more preset criteria, and wherein the first characteristic information is calculated based on the respective peak value characteristic point sequences of the preset number of phase channels.
地址 Shenzhen CN
您可能感兴趣的专利