发明名称 Technique for Information Retrieval Using Enhanced Latent Semantic Analysis
摘要 A technique for information retrieval includes parsing a corpus to identify a number of wordform instances within each document of the corpus. A weighted morpheme-by-document matrix is generated based at least in part on the number of wordform instances within each document of the corpus and based at least in part on a weighting function. The weighted morpheme-by-document matrix separately enumerates instances of stems and affixes. Additionally or alternatively, a term-by-term alignment matrix may be generated based at least in part on the number of wordform instances within each document of the corpus. At least one lower rank approximation matrix is generated by factorizing the weighted morpheme-by-document matrix and/or the term-by-term alignment matrix.
申请公布号 US2010185685(A1) 申请公布日期 2010.07.22
申请号 US20090352621 申请日期 2009.01.13
申请人 CHEW PETER A;BADER BRETT W 发明人 CHEW PETER A.;BADER BRETT W.
分类号 G06F17/30;G06F7/00;G06F17/28 主分类号 G06F17/30
代理机构 代理人
主权项
地址
您可能感兴趣的专利