发明名称 Systems, methods, and computer program products for providing contextually-aware video recommendation
摘要 Methods, systems and computer program products are provided for providing content recommendation by obtaining metadata associated with a media object, extracting from the metadata a plurality of terms associated with the media object, and mapping at least a portion of the plurality of terms to buckets. A query vector having attributes corresponding to the buckets is used to perform a query on a database storing media object documents having attributes corresponding to the buckets.
申请公布号 US9451329(B2) 申请公布日期 2016.09.20
申请号 US201414509396 申请日期 2014.10.08
申请人 SPOTIFY AB 发明人 Whitman Brian;Rodger David
分类号 H04N7/10;H04N7/025;H04N21/482;H04N21/462;H04N21/442;H04N21/435;H04N21/432;G06Q30/02;G06F17/30 主分类号 H04N7/10
代理机构 Fitzpatrick, Cella, Harper & Scinto 代理人 Fitzpatrick, Cella, Harper & Scinto
主权项 1. A method of providing a database of content-level attributes associated with media objects, the method comprising: performing on at least one computer of a content recommendation system, the at least one computer having a query interface for receiving a query containing at least one content-level attribute, the steps of: obtaining, by crawling a network using a server adapted to gather text data, metadata associated with a media object from a plurality of data sources, wherein at least one of the plurality of data sources is an unstructured data source and the metadata includes extrinsic metadata corresponding to one or more content-level attributes of the media object; extracting from the metadata a plurality of terms associated with the media object by applying an entity extraction model to the metadata; mapping at least a portion of the plurality of terms to a plurality of buckets using an indexing engine in combination with a clustering framework configured to cluster the plurality of terms to categorization terms associated with each bucket; calculating, for each term of the plurality of terms, a probability that the term is associated with the media object; associating the probability to each term, correspondingly; generating a vector of content-level attributes corresponding to the media object based on the associating; storing the vector of content-level attributes in a database; receiving via the query interface a query vector containing at least one of the content-level attributes; searching the database for at least one content-level attribute contained in the query vector; and providing, in response to the searching, a query result containing the media object.
地址 Stockholm SE