发明名称 COORDINATING AND MIXING AUDIOVISUAL CONTENT CAPTURED FROM GEOGRAPHICALLY DISTRIBUTED PERFORMERS
摘要 Audiovisual performances, including vocal music, are captured and coordinated with those of other users in ways that create compelling user experiences. In some cases, the vocal performances of individual users are captured (together with performance synchronized video) on mobile devices, television-type display and/or set-top box equipment in the context of karaoke-style presentations of lyrics in correspondence with audible renderings of a backing track. Contributions of multiple vocalists are coordinated and mixed in a manner that selects for visually prominent presentation performance synchronized video of one or more of the contributors. Prominence of particular performance synchronized video may be based, at least in part, on computationally-defined audio features extracted from (or computed over) captured vocal audio. Over the course of a coordinated audiovisual performance timeline, these computationally-defined audio features are selective for performance synchronized video of one or more of the contributing vocalists.
申请公布号 US2016057316(A1) 申请公布日期 2016.02.25
申请号 US201514928727 申请日期 2015.10.30
申请人 Smule, Inc. 发明人 Godfrey Mark T.;Cook Perry R.
分类号 H04N5/04;G10L13/033;G10H1/36;G10L21/013 主分类号 H04N5/04
代理机构 代理人
主权项 1. A method of preparing coordinated audiovisual performances from geographically distributed performer contributions, the method comprising: receiving via a communication network, a first audiovisual encoding of a first performer, including first performer vocals captured at a first remote device; receiving via the communication network, a second audiovisual encoding of a second performer, including second performer vocals captured at a second remote device; determining at least one time-varying, computationally-defined audio feature for the first performer vocals; determining at least one time-varying, computationally-defined audio feature for the second performer vocals; and based on comparison of the computationally-defined audio feature for first and second performer vocals, dynamically varying relative visual prominence of respective first and second performers throughout a combined audiovisual performance mix of the captured first and second performer vocals with the backing track.
地址 San Francisco CA US