发明名称 EXTENDING A SEED LIST TO SUPPORT METADATA MAPPING
摘要 Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a method, system and computer program product for metadata processing for seed lists for structured content sources. In one embodiment, a method for processing metadata for a seed list can include extracting metadata from a seed list for application content, storing the metadata in a repository, associating the metadata with fields of the application content, crawling the fields of the application content by reference to the metadata, and indexing the fields. In an aspect of the embodiment, the method further can include annotating the application to produce metadata for the fields of the application content. In yet another aspect of the embodiment, the method can include mapping the metadata to a document schema generic to a plurality of heterogeneous application content.
申请公布号 US2009006364(A1) 申请公布日期 2009.01.01
申请号 US20070770419 申请日期 2007.06.28
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 KONOPNICKI DAVID;HASSON LAURENT D.
分类号 G06F7/06;G06F17/30 主分类号 G06F7/06
代理机构 代理人
主权项
地址