发明名称 METHOD AND DEVICE FOR STRUCTURING QUERY AND INTERPRETATION OF SEMISTRUCTURED INFORMATION
摘要 PROBLEM TO BE SOLVED: To refer to semistructured information at web sites in various parts to structure it. SOLUTION: This method comprises the steps of: identifying patterns of interest by examining the semistructured information including text information using lexical analysis for repetitive patterns, cataloging the patterns by name and position in a nested structure, examining patterns in the nested structure to identify attributes that correspond to fields of a relational schema of a relational database (S306); examining the patterns in the nested structure to identify the patterns, decomposing the patterns to catalog them in the nested structure, examining the patterns in the nested structure to identify links to other semistructured information (S308); and cataloging the patterns of interest in the nested structure, repeating the above steps until all of the nested information is cataloged to obtain definition including regular expressions of the semistructured information so that it may be utilized by a dedicated program translator. COPYRIGHT: (C)2008,JPO&INPIT
申请公布号 JP2008123547(A) 申请公布日期 2008.05.29
申请号 JP20080007850 申请日期 2008.01.17
申请人 AMAZON.COM INC 发明人 ASHISH KAPTA;BENKII HARINARIYAN;DARAN KUASS;RAJARAMAN ANAND
分类号 G06F13/00;G06F17/30;G06F12/00 主分类号 G06F13/00
代理机构 代理人
主权项
地址