发明名称 Improvements in vocoders
摘要 <p>1,087,305. Vocoders. INTERNATIONAL BUSINESS MACHINES CORPORATION. May 20, 1965 [May 26, 1964], No. 21314/65. Heading H4R. In an analysis synthesis telephone system an excitation function is derived comprising a pulse sequence of a repetition frequency equal to the fundamental frequency of the speech input, which pulse sequence is used to control the sampling of the energy in the spectrum analysis bands. During unvoiced sounds, when the speech input is of such a character as to prevent the production of a satisfactory excitation function, an auxiliary pulse generator is switched in to control the sampling in the spectrum analysis bands. Fig. 3 shows an analyser in which a speech signal at 1 is applied to a low pass filter 2 and a zero crossing detector 3 to generate a train of pulses at the fundamental frequency of the speech input. These pulses are fed via an OR gate 17 to a sampling pulse generator 14 to generate pulses to open gates 8 1 to 8 n to feed samples from the outputs of the band pass filters 7 1 to 7 n , arranged to divide the frequency spectrum of the incoming speech into a number, n, of frequency bands, to the analog to digital converters 9 1 to 9 n , the outputs of which define the energy distribution across the frequency spectrum of the speech, i.e. the aggregate function. If the speech input is such that the zero crossing detector fails to produce pulses a single shot circuit, which is maintained in its metastable state by the pulses from the zero crossing detector, reverts to its stable state in 12 m. sees. to cause the pulse generator 16 to supply pulses, via OR gate 17, to the sampling generator 14 at 3 m. sec. intervals until pulses reappear at the output of the zero crossing detector. The pulse output from the zero crossing detector 3 is also applied to an " AND " gate 6 to feed the count of counter 5 to the output circuit 19, the counter being subsequently reset by the same pulse so that the counter output is a measure of the interval between pulses, the count being transmitted as the excitation function. The speech input at 1, via an amplifier 10 and level detector 11, keeps a single shot 12 in its metastable state to maintain gate 13 open so that if the speech level remains at zero for longer than 100 m. sees. the single shot 12 reverts to its stable state to inhibit the feed of sampling pulses from generator 14 to the gates 91 to 9 n . Voiced/unvoiced detector 18 described in detail with reference to Fig. 4 (not shown) comprises a logic circuit which produces an output, either when all the outputs of a number of the lowest frequency analysis filters are zero, or when the outputs of these filters is lower than that of groups of the same number of higher frequency analysis filters. Synthesiser circuit, Fig. 5.-Excitation function signals from the input-distributer circuit 26 are fed to a register 27 which sets counter 28 in accordance with the count corresponding to the interval between the excitation function pulses. The counter then counts, under the control of generator 29, down to zero when it generates a pulse to feed to distributer 35 via OR gate 36 before being reset in accordance with the following count in register 27. Meanwhile the aggregate function signals and voiced/ unvoiced signals are fed to the register 30, the voiced/unvoiced signals being used via a trigger circuit 37 to enable gate 38 to feed pulses from the auxiliary generator 39 to OR gate 36. and hence the distributer 35, in the absence of pulses from the counter 28 during unvoiced sounds. The digital to analog converter 31 converts the received aggregate function signals to analogue signals whose amplitudes vary according to the spectral energy distribution of the original speech, these signals being fed to the gates 32 1 to 32 n where they are gated appropriately by the pulses from the distributer 35. The resulting signals are passed through band-pass filters similar to the analysis filters in the analyser to an adding circuit 34 from which the synthesised speech is obtained. Since the mean frequency of the auxiliary generator 39, about 1000 c./s., is much higher than the fundamental speech frequency from counter 28, about 80 to 300 c./s., during unvoiced sounds the sample pulses from the digital to analog converter are narrowed in order that the energy intensity of the synthesised unvoiced sounds is made commensurate with that of the voiced sounds.</p>
申请公布号 GB1087305(A) 申请公布日期 1967.10.18
申请号 GB19650021314 申请日期 1965.05.20
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 LAMPARTER HELMUT;KNAUFT GUNTHER;SPRUTH WILHELM
分类号 G10L19/00;G10L19/02;G10L19/04;G10L25/93;H03K21/00;H04J3/17 主分类号 G10L19/00
代理机构 代理人
主权项
地址