主权项 |
1. A computer-implemented method for discovering local content publishers and what is locally interesting to a local audience and subsequently generating, on an ongoing basis, streams of locally relevant content, comprising:
generating, via one or more server machines, an initial set of local terms for a place, the place being a geographic area, the generating comprising:
utilizing a geographic database, the geographic database containing geographic information and spatial data indicative of a geographical hierarchy; andidentifying the initial set of local terms using the geographical hierarchy, the initial set of local terms being unique to the place; building a local content corpus for the place utilizing the initial set of local terms, the local content corpus containing documents that are semantically related to each other with respect to the place, the documents contained in the local content corpus identified via a search using the initial set of local terms; utilizing the local content corpus, determining an importance of each of the initial set of local terms; populating an index, via an indexing engine, the index configured for assigning an importance ranking to the documents contained in the local content corpus, the importance ranking calculated by: generating an algebraic model for each document contained in the local content corpus, the algebraic model configured to represent each document as a plurality of vectors of index terms such that each vector of the plurality of vectors corresponds to a separate term of the initial set of local terms and wherein, in an instance in which the separate terms appears in a documents, the corresponding value in the vector corresponding to the separate term is non-zero; and in response to a query to the index, the query comprised of one or more search parameters, outputting a predetermined portion of the index matching the search parameters. |