Efficient indexing of documents with similar content,申请号US20060419423-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Efficient indexing of documents with similar content
摘要	A set of documents may be stored and indexed as a compressed sequence of tokens. A set of documents are grouped into clusters. Sequences of tokens representing the clusters of documents are encoded to elide some repeating instances of tokens. A compressed sequence of tokens is generated from the compressed cluster sequences of tokens. Queries on the compressed sequence are performed by identifying cluster sequences within the compressed sequence that are likely to have documents that satisfy the query and then identifying, within these identified clusters, the documents that actually satisfies the query.
申请公布号	US8175875(B1)	申请公布日期	2012.05.08
申请号	US20060419423	申请日期	2006.05.19
申请人	DEAN JEFFREY A.;GHEMAWAT SANJAY;THAMBIDORAI GAUTHAM;GOOGLE INC.	发明人	DEAN JEFFREY A.;GHEMAWAT SANJAY;THAMBIDORAI GAUTHAM
分类号	G10L15/06	主分类号	G10L15/06
代理机构		代理人
主权项
地址

您可能感兴趣的专利

DISPLACEMENT DETECTOR USING LIMIT SWITCH

WORKING DEVICE FOR INNER FACE AND OUTER FACE OF METALLIC TUBE

FRAME OF SEWING MACHINE

PINBALL GAME MACHINE

ULTRASONIC DIAGNOSTIC APPARATUS

SOUND-PROOFING MATERIAL OF ELECTRIC CLEANER

HAIR CONDITIONING TOOL

APPARATUS FOR MEASURING COMPONENTS CONTAINED IN MOLTEN STEEL

DRAINAGE APPARATUS OF CHEST PART

MARKING DEVICE FOR SEMICONDUCTOR WAFER

CENTERING METHOD OF BLADDER AT VULCANIZATION PROCESS

FOCUS SERVO LEADING-DEVICE

AUTOMATIC RETRIEVAL SYSTEM FOR DUMPING OF MAIN STORAGE

INSTRUCTION REFETCH SYSTEM

MANUFACTURE OF SEMICONDUCTOR DEVICE

MANUFACTURE OF SEMICONDUCTOR DEVICE

ELECTROSTATIC CAPACITY TYPE LIQUID MEASURING GAUGE

REARRANGING DEVICE FOR DATA IN INCREASING OR DECREASING ORDER