发明名称 EXTRACTION OF CERTAIN TYPES OF ENTITIES
摘要 Certain types of entities may be extracted from a document. In one example, the entities to be recognized are cultural entities, such as the names of movies, video games, books, etc. For each such entity, a concept graph may be built that shows the relationship between the entity itself and other entities, such as the relationship between a movie and the actor(s) who act in the movie. When a candidate entity name is detected in the document, the concept graph may be used to look for other entities that appear in the context of the candidate entity. The presence of related entities in the context of the candidate may be used to disambiguate the meaning of the candidate. For example, a common word like “up” might be recognized as the name of a movie if the names of actors or characters in that movie appear near the word “up”.
申请公布号 US2011131244(A1) 申请公布日期 2011.06.02
申请号 US20090626905 申请日期 2009.11.29
申请人 MICROSOFT CORPORATION 发明人 PADOVITZ AMIR J.;HURST MATTHEW F.
分类号 G06F17/30;G06F15/18 主分类号 G06F17/30
代理机构 代理人
主权项
地址