发明名称 Extraction of information from structured documents
摘要 <p>A method of extracting information from a structured document includes the steps of assigning a partial tree identifier inclusive of a tag identifier to a selected partial tree wherein the tag identifier includes a name of a tag corresponding to a root of the selected partial tree, a name of at least one format attribute of the tag, and a value of the at least one format attribute, arranging names of format attributes in a predetermined order in the tag identifier if the at least one format attribute of the tag includes two or more format attributes, and identifying a partial tree having a partial tree identifier identical to the partial tree identifier of the selected partial tree from a list of partial tree identifiers of partial trees that exist in the structured document after updating thereof. &lt;IMAGE&gt;</p>
申请公布号 EP1376408(A2) 申请公布日期 2004.01.02
申请号 EP20030253818 申请日期 2003.06.17
申请人 NIPPON TELEGRAPH AND TELEPHONE CORPORATION 发明人 UCHIYAMA, TADASU;MIYAMOTO, MASARU
分类号 G06F17/21;G06F17/22;G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址