发明名称 CONTENT RETRIEVAL DEVICE, METHOD, AND PROGRAM
摘要 <P>PROBLEM TO BE SOLVED: To associate a specific position of a document with content at low cost without setting any object area with which the content is associated. <P>SOLUTION: A content retrieval device of this present invention is configured to extract a character block from a document and to associate the character block, and a page identifier and coordinates within a page in the document where the character block appears, to output to an index DB. The index DB is searched based on a query character block extracted from an inputted search query (a partial area in the document) to tabulate retrieval results for every page, and a page where the largest number of character blocks have been retrieved is defined as a hit page. The center of gravity of the coordinates within the page of the character block retrieved from the hit page is calculated and defined as a hit position within the page. The calculated hit page and the hit position within the page are defined as a query, and content with which the neighboring page position of the hit position within the page is associated is retrieved from the content DB. <P>COPYRIGHT: (C)2012,JPO&INPIT
申请公布号 JP2012003356(A) 申请公布日期 2012.01.05
申请号 JP20100135606 申请日期 2010.06.14
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 FUJIMURA TAKASHI;MIYATA AKIHIRO;SHIOBARA TOSHIKO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址