MEDIA MATERIAL ANALYSIS OF CONTINUING ARTICLE PORTIONS
摘要
The present invention relates to systems and methods for analyzing media material having articles continuing across multiple pages. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body text in the media material. The article composer determines which of the identified block segments belong to a continuing article extending across multiple pages in the media material based on language statistics information and continuation transition information.
申请公布号
WO2008057473(A3)
申请公布日期
2008.07.24
申请号
WO2007US23233
申请日期
2007.11.05
申请人
GOOGLE INC.;FURMANIAK, RALPH;SMITH, RAY;VINCENT, LUC;BLOOMBERG, DAN
发明人
FURMANIAK, RALPH;SMITH, RAY;VINCENT, LUC;BLOOMBERG, DAN