摘要 |
The present invention is directed to a system and method which users can use to identify data base elements in a web page, store the extraction template representing the location and type of elements on the page, extract and store the product record in their collection, use the extraction template to automatically extract all the data from the web site and constantly check the extraction templates for correctness and update the extraction templates if necessary. Additionally, the present invention system provides crowd sourced web page data record extraction template creation to build a database of web page extraction templates which could then be used by others to extract the information from the web pages at the site where the extraction template(s) were created, and to save the information to a social network. Moreover, crowd based web page data record extraction template creation and storage system can be used to create extraction templates for batch extraction of information from remote web sites. Also, the data record information extracted from the web page to find the same or similar products at other web sites can be sited in a central product record data base that is created with the previously mentioned batch extraction system. |