摘要 |
A method of recognizing artist ambiguity is performed at a server system having one or more processors and memory storing one or more programs for execution by the one or more processors. The method includes generating a feature vector that represents a first artist identifier of a plurality of artist identifiers in a first dataset. The feature vector includes a first feature indicating whether the first artist identifier matches multiple artist entries in one or more second datasets that are distinct from the first dataset. The method also includes determining, based at least in part on the first feature of the feature vector, a probability that the first artist identifier is associated with two or more different real-world artists, and providing a report that specifies the first artist identifier as potentially ambiguous in accordance with a determination that the probability satisfies a predetermined condition. |