摘要 |
A method cleans television viewing behavior data collected from a plurality of television set top boxes by using aggregation to detect an excess or a deficit in viewership for a group of television set top boxes. In various aspects, the group of set top boxes may be associated with a particular television service provider, cable television head-end, or data warehouse. Additionally, the method can clean television viewing behavior data by detecting and correcting aberrant viewership in a time series, that is based on a weekly or an approximately monthly frequency. The aberrant viewership can be detected by calculating a minimum expected number of viewers for a day, and comparing it to the actual number of households that reported viewers for that day.
|