发明名称 METHOD AND DEVICE USED FOR EXTRACTING WEBPAGE CONTENT
摘要 <p><P>PROBLEM TO BE SOLVED: To provide a method and device used for extracting webpage contents, which obtain further optimal webpage extraction results. <P>SOLUTION: The method and device used for extracting webpage contents are disclosed. This method includes: extracting the webpage contents for webpage input based on the digital document analysis (DDA) method and creating the DDA extraction result; extracting the web page contents for web page input based on the document image resolution (DIR) method and creating the DIR extraction result; and combining the DDA extraction result and the DIR extraction result and creating the combination result. <P>COPYRIGHT: (C)2009,JPO&INPIT</p>
申请公布号 JP2009193571(A) 申请公布日期 2009.08.27
申请号 JP20080324056 申请日期 2008.12.19
申请人 RICOH CO LTD 发明人 DU CHENG
分类号 G06F17/21 主分类号 G06F17/21
代理机构 代理人
主权项
地址