SYSTEM AND METHOD FOR DOCUMENT ANALYSIS, PROCESSING AND INFORMATION EXTRACTION
摘要
The present invention is directed to a method and computer system for representing a dataset comprising N documents by computing a diffusion geometry of the dataset comprising at least a plurality of diffusion coordinates. The present method and system stores a number of diffusion coordinates, wherein the number is linear in proportion to N.
申请公布号
EP1782278(A4)
申请公布日期
2012.07.04
申请号
EP20050763161
申请日期
2005.06.23
申请人
PLAIN SIGHT SYSTEMS, INC.
发明人
COIFMAN, RONALD, R.;COPPI, ANDREAS, C.;GESHWIND, FRANK;LAFON, STEPHANE, S.;LEE, ANN, B.;MAGGIONI, MAURO, M.;WARNER, FREDERICK, J.;ZUCKER, STEVEN;FATELEY, WILLIAM, G.