发明名称 Teleconferencing environment having auditory and visual cues
摘要 A teleconferencing environment is provided in which both audio and visual cues are used to identify active participants and presenters. Embodiments provide an artificial environment, configurable by each participant in a teleconference, that directs the attention of a user to an identifier of an active participant or presenter. This direction is provided, in part, by stereo-enhanced audio that is associated with a position of a visual identifier of an active participant or presenter that has been placed on a window of a computer screen. The direction is also provided, in part, by promotion and demotion of attendees between attendee, active participant, and current presenter and automatic placement of an image related to an attendee on the screen in response to such promotion and demotion.
申请公布号 US9445050(B2) 申请公布日期 2016.09.13
申请号 US201414543031 申请日期 2014.11.17
申请人 Freescale Semiconductor, Inc. 发明人 Travis Edward O.;Reber Douglas M.
分类号 H04N7/14;H04N7/15;H04L29/06;G10L17/00 主分类号 H04N7/14
代理机构 代理人
主权项 1. A teleconference system comprising: an attendee site node, coupled via a network to a teleconference server coupled to a plurality of remote attendee site nodes, and comprising a network interface configured to receive audio information from the teleconference server wherein the audio information corresponds to an audio signal originating from an originating attendee site node that is a member of the plurality of remote attendee site nodes,receive current speaker identification information from the teleconference server wherein the current speaker identification information is associated with the originating attendee site node,receive visual data from the teleconference server andreceive gestural input information from the teleconference server, by the network interface, wherein the gestural input information is associated with the originating attendee site node, andthe gestural input information corresponds to input data generated by a user of the originating attendee site node manipulating an input device over a location on an image of the visual data; anda processor coupled to the network interface and configured to associate the current speaker identification information with a first image stored in a memory coupled to the processor,display the first image at a first selected location in an attendee area on a display coupled to the processor,generate a modified audio signal from the audio information, andapply the modified audio signal to a playback device coupled to the processor, wherein the modified audio signal is generated to sound as if it is originating from a spatial location corresponding to the first selected location when emitted from the playback device,display the visual data in a slide area of a presenter area on the display wherein the presenter area is distinct from the attendee area,select an avatar image corresponding to the current speaker identification information wherein the avatar image is stored in the memory,display the avatar in a current presenter area of the presenter area, wherein the current presenter area of the presenter area is displayed next to the slide area of the presenter area,modify the avatar display in response to the gestural input information to provide a visual cue from the avatar to the corresponding location of the slide area in the presenter area.
地址 Austin TX US