发明名称 SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION
摘要 A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.
申请公布号 US2017085837(A1) 申请公布日期 2017.03.23
申请号 US201615369576 申请日期 2016.12.05
申请人 Polycom, Inc. 发明人 Feng Jinwei
分类号 H04N7/15;G06T7/20;H04R1/40 主分类号 H04N7/15
代理机构 代理人
主权项
地址 San Jose CA US