发明名称 Digital media voice tags in social networks
摘要 A voice tagging system includes a client computing device that includes a media object capture device and a voice capture device and runs a client application that associates media objects to voice samples. The system also includes a communications network coupled to the client computing device, a voice tagging system coupled to the communications network and receiving at least one association between a first media object and a first voice sample, and a database coupled to the voice tagging system, the database including one or more voice tags, each voice tag being coupled to one or more voice samples.
申请公布号 US8903847(B2) 申请公布日期 2014.12.02
申请号 US201012718041 申请日期 2010.03.05
申请人 International Business Machines Corporation 发明人 Bailey Mark;Christensen James E.;Danis Catalina M.;Ellis Jason B.;Erickson Thomas D.;Farrell Robert G.;Kellogg Wendy A.
分类号 G06F7/00;G06F17/30;G10L15/00;G10L15/26;G10L15/10;G10L17/00;G06F15/16;H04M3/493 主分类号 G06F7/00
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP ;Young Preston
主权项 1. A system comprising: a client computing device, the client computing device including a media object capture device and a voice capture device and running a client application that associates media objects to voice samples; a communications network coupled to the client computing device; a voice tagging system coupled to the communications network and receiving at least one association between a first media object and a first voice sample, the voice tagging system receiving the first voice sample from an unidentified user and the voice tagging system being configured to identify the unidentified user that provided the first voice sample based on analysis speech components of the first voice sample; and a database coupled to the voice tagging system, the database including one or more existing voice tags, each voice tag being coupled to one or more existing voice samples and including at least one voice tag coupled to two different voice samples, wherein the system includes programming causing phoneme representations of the existing voice samples in the database to be compared to a phoneme representation of the first voice sample, wherein said comparing is based on an initial number of phonemes at the start of the first voice sample and the existing voice samples that is less than the total number of phonemes and includes sequentially comparing phonemes of a first existing voice sample to the phoneme representation of the first voice sample wherein earlier phonemes are weighted higher than later phonemes.
地址 Armonk NY US