发明名称 METHOD AND SYSTEM FOR ADDING PUNCTUATION TO VOICE FILES
摘要 A method and system for adding punctuation to a voice file is disclosed. The method includes: utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file includes a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.
申请公布号 US2014350918(A1) 申请公布日期 2014.11.27
申请号 US201414219704 申请日期 2014.03.19
申请人 TENCENT TECHNOLOGY (SHENZHEN) CO., LTD. 发明人 LIU Haibo;WANG Eryu;ZHANG Xiang;LU Li;YUE Shuai;CHEN Bo;LI Lou;LIU Jian
分类号 G06F17/24;G06F17/27 主分类号 G06F17/24
代理机构 代理人
主权项 1. A method for adding punctuations to a voice file, comprising: utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file comprising a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, wherein the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.
地址 Shenzhen CN