摘要 |
Various of the disclosed embodiments contemplate television viewing behavior data collected from a plurality of television set top boxes by using aggregation to detect an excess or a deficit in viewership for a group of television set top boxes. In some embodiments, a group of set top boxes can be associated with a particular television service provider, cable television head-end, or data warehouse. Additionally, some embodiments can clean television viewing behavior data by detecting and correcting aberrant viewership in a time series, e.g. based on a weekly or an approximately monthly frequency. In some embodiments, the aberrant viewership can be detected by calculating a minimum expected number of viewers for a day and comparing it to the actual number of households that reported viewers for that day. |