摘要 |
A system provides an interface for a client application to programmatically access a Web site. A simple data extractor returns a retrieved Web page in a well-formed syntax (such as XHTML) to the client application. Web forms crawled or otherwise retrieved from the Internet are translated into service interfaces, or APIs. The service interfaces are expressed in a simple object access protocol (SOAP), and are registered in a universal description, discovery, and integration (UDDI) registry. For each Web form analyzed, the present system automatically generates a program code that receives SOAP requests and translates them into regular Web accesses, thus providing applications and application developers with an easy, programmatic interface to Web sites such as search engines and other databases. The Web pages are first processed by a component that analyzes the forms and generates a service description (SD) file for each form. The SD file contains all the information necessary for producing the ultimate output of the system: API descriptions in the form of WSDL (Web Services Description Language) files, WDS (Well-Defined Service) files, and ISD (Interface Service Deployment) files. The SD file is also used for generating executable program code such as the Java language that implements the service. The executable code is then compiled, deployed in a SOAP service manager using the ISD file, and finally published in a UDDI registry using the WSDL and WDS files.
|