发明名称 Speech capturing and speech rendering
摘要 The invention proposes extracting one or more speech signals (151-154) as well as one or more ambient signals (131) from sound signals captured by microphones, wherein each of the speech signals corresponds to a different speaker. The invention proposes to transmit both the one or more speech signals (151-154) and the one or more ambient signals (131) to a rendering side, as opposed to sending only speech signals. This enables to reproduce the speech and ambient signals in a spatially different way at the rendering side. By reproducing the ambient signals a feeling of “being together” is created. In an embodiment, the invention enables reproducing two or more speech signals spatially from each other and from the ambient signals so that speech intelligibility is increased despite the presence of the ambient signals.
申请公布号 US8781818(B2) 申请公布日期 2014.07.15
申请号 US200913141710 申请日期 2009.12.17
申请人 Koninklijke Philips N.V. 发明人 Janse Cornelis Pieter;Van Stuivenberg Leon C. A.;Belt Harm Jan Willem;Sarroukh Bahaa Eddine;Triki Mahdi
分类号 G10L19/00 主分类号 G10L19/00
代理机构 代理人
主权项 1. A speech capturing device-comprising: a capturing circuit, wherein the capturing circuit includes a plurality of microphones for capturing a plurality of sound signals originating from different spatial locations; one or more extracting circuits each for deriving a respective speech signal corresponding to a respective speaker from the plurality of the sound signals; a residual extracting circuit for deriving one or more ambient signals from the plurality of sound signals each decreased by the one or more speech signals derived by the one or more extracting circuits; and a transmitting circuit for transmitting the one or more speech signals and the one or more ambient signals, further comprising: an audiovisual locator for (i) determining one or more locations of the speakers and (ii) providing one or more output signals of spatial information about the locations of the speakers to the one or more extracting circuits, respectively, wherein each extracting circuit derives the respective speech signal further in response to a respective output signal of spatial information directed to a location of a respective one of the speakers.
地址 Eindhoven NL