发明名称 Method and system for determining relevance of terms in text documents
摘要 The present invention provides a corpus-independent method for determining relevancy of terms to content of text appearing in a document by analyzing the document itself. Conventional information extraction, or other methods, may be applied to a document to generate a list of terms. The invention analyzes the document using relevancy scoring algorithms to determine a term relevancy score representing the term's relevance to the text contained in the document. The scores, including an aggregate score, may be normalized in the process. Based on relevancy scoring, terms are then ranked and further processed. In this manner relevancy is determined based on the subject document itself and by analyzing the occurrences and locations of the terms within the document. Additional techniques may be applied to relate the relevancy scores generated by the present invention to a corpus or collection of documents.
申请公布号 US2011004606(A1) 申请公布日期 2011.01.06
申请号 US20090459475 申请日期 2009.07.01
申请人 AUMANN YEHONATAN;KELLER ORGAD;SHLIVINSKI RAN 发明人 AUMANN YEHONATAN;KELLER ORGAD;SHLIVINSKI RAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址