发明名称 Methods and systems for analyzing data in media material having layout
摘要 The present invention relates to systems and methods for analyzing media material having a layout. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body text in the media material. The article composer determines which of the identified block segments belong to one or more articles in the media material. The article composer can determine whether candidate block segments belong to a same article based on language statistics information, layout transition information, or both language statistics information and layout transition information. A system for searching media material having a layout over a network is also provided.
申请公布号 US2008107337(A1) 申请公布日期 2008.05.08
申请号 US20060592268 申请日期 2006.11.03
申请人 GOOGLE INC. 发明人 FURMANIAK RALPH;SMITH RAY;VINCENT LUC;BLOOMBERG DAN;LEE DAR-SHYANG
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址