The present invention provides a system which is able to detect similar web page elements which are described in mark-up language, such that the content of those elements can be captured. Text content may then be sent to a text classifier for further analysis.
申请公布号
US2013019163(A1)
申请公布日期
2013.01.17
申请号
US201113637483
申请日期
2011.03.28
申请人
BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY;THOMPSON SIMON G;NGUYEN DUONG T;THINT MARCUS ALFRED;GHARIB HAMID
发明人
THOMPSON SIMON G;NGUYEN DUONG T;THINT MARCUS ALFRED;GHARIB HAMID