摘要 |
Systems, methods, and computer storage media having computer-executable instructions embodied thereon that utilize images to generate identifiers of entities and to determine various relationships between entities and web pages are described. A collection of class images that represent various people and non-people entities deemed relevant for tracking user behavior is generated. Given a class image that represents an entity, an image identifier comprising unique feature vectors for that class image is generated. Given an arbitrary web page, all the images embedded on the web page are analyzed. Each embedded image is compared to the image identifiers of the collection of class images to determine whether or not any entities of interest appear in the embedded images of the web page. If relevant entities do appear on the web page, various associations can be made between the entities found on the page, and between the uniform resource locator (URL) of the web page where the entities appear.
|