发明名称 METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING
摘要 SPEECH SIGNAL CLASSIFICATION AND ENCODING SYSTEMS AND METHODS (100) ARE DISCLOSED HEREIN. THE SIGNAL CLASSIFICATION IS DONE IN THREE STEPS EACH OF THEM DISCRIMINATING A SPECIFIC SIGNAL CLASS. FIRST, A VOICE ACTIVITY DETECTOR (VAD) DISCRIMINATES BETWEEN ACTIVE AND INACTIVE SPEECH FRAMES (102). IF AN INACTIVE SPEECH FRAME IS DETECTED (BACKGROUND NOISE SIGNAL) THEN THE CLASSIFICATION CHAIN ENDS AND THE FRAME IS ENCODED WITH COMFORT NOISE GENERATION (CNG) (402). IF AN ACTIVE SPEECH FRAME IS DETECTED, THE FRAME IS SUBJECTED TO A SECOND CLASSIFIER DEDICATED TO DISCRIMINATE UNVOICED FRAMES (404). IF THE CLASSIFIER CLASSIFIES THE FRAME AS UNVOICED SPEECH SIGNAL, THE CLASSIFICATION CHAIN ENDS, AND THE FRAME IS ENCODED USING A CODING METHOD OPTIMIZED FOR UNVOICED SIGNALS. OTHERWISE, THE SPEECH FRAME IS PASSED THROUGH TO THE "STABLE VOICED" CLASSIFICATION MODULE (110).IF THE FRAME IS CLASSIFIED AS STABLE VOICED FRAME, THEN THE FRAME IS ENCODED USING A CODING METHOD OPTIMIZED FOR STABLE VOICED SIGNALS. OTHERWISE, THE FRAME IS LIKELY TO CONTAIN A NON-STATIONARY SPEECH SEGMENT SUCH AS A VOICED ONSET OR RAPIDLY EVOLVING VOICED SPEECH SIGNAL. IN THIS CASE A GENERAL-PURPOSE SPEECH CODER IS USED AT A HIGH BIT RATE FOR SUSTAINING GOOD SUBJECTIVE QUALITY.(FIG 1)
申请公布号 MY134085(A) 申请公布日期 2007.11.30
申请号 MYPI20033873 申请日期 2003.10.10
申请人 NOKIA CORPORATION 发明人 MILAN JELINEK
分类号 G01L19/14;G10L19/14;G10L11/04;G10L19/00;G10L19/02;G10L21/02 主分类号 G01L19/14
代理机构 代理人
主权项
地址