主权项 |
1. A method for automatic generation of a word-cloud for a content item, comprising:
extracting terms from a content item, belonging to a specific domain including a plurality of content items, using statistical selection criteria; providing a folksonomy of terms previously used by users to tag content items, including items belonging to an external domain and not belonging to the specific domain; weighting a plurality of the extracted terms by a tag probability, determined from the provided folksonomy, that the term is used as a tag; and generating a visual representation of the extracted terms in which each of the extracted terms is displayed with a representation weight calculated from a statistical weight of the extracted term, boosted by the weighting tag probability of the extracted term;wherein said steps are implemented in either:
computer hardware configured to perform said steps, or computer software embodied in a non-transitory, tangible, computer-readable storage medium,
wherein providing the folksonomy comprises providing a folksonomy of tags suggested by people and wherein weighting the plurality of extracted terms comprises weighting by a probability, determined from the provided folksonomy, that the term is used as a tag by people. |