发明名称 Determining Which Participant is Speaking in a Videoconference
摘要 Aspects herein describe methods and systems of receiving, by one or more cameras, images in which the images comprise facial images of individuals. Aspects of the disclosure describe extracting the facial images from the images received, sorting the extracted facial images into separate groups wherein each group corresponds to the facial images of each individual, and selecting, for each individual, a preferred facial image from each group. The preferred facial images selected are transmitted to a client for display. Aspects of the disclosure also describe selecting either a facial recognition algorithm or an audio triangulation algorithm to use to determine which individual is speaking wherein the selection is based on whether lip movement of one or more of the individuals is visible in the images received from the cameras.
申请公布号 US2015310260(A1) 申请公布日期 2015.10.29
申请号 US201514740498 申请日期 2015.06.16
申请人 Citrix Systems, Inc. 发明人 Summers Jacob Jared
分类号 G06K9/00;H04N7/15 主分类号 G06K9/00
代理机构 代理人
主权项 1. A system comprising: one or more cameras; a device configured to be connected to the one or more cameras, the device comprising one or more processors and memory storing instructions that, when executed by one of the processors, cause the device to obtain, from at least one of the cameras, a set of images of a plurality of individuals at a location,select, from the set of images, for each individual, a preferred facial image for the individual,determine whether lip movement of one of the individuals is visible in the set of images, andselect, based on whether lip movement of one of the individuals is visible in the set of images, either a facial recognition algorithm or an audio triangulation algorithm to determine which individual is speaking.
地址 Fort Lauderdale FL US