摘要 |
One embodiment of the present invention provides a system for estimating a similarity level between documents. During operation, the system receives a number of documents, extracts a number of semantic entities from the documents, calculates the similarity level between the documents based on the extracted semantic entities, and produces a result indicating similar documents based on the calculated similarity level. |