发明名称 METHOD FOR EXTRACTING SUBJECT AND SORTING DOCUMENT OF SEARCHING ENGINE, COMPUTER READABLE RECORD MEDIUM ON WHICH PROGRAM FOR EXECUTING METHOD IS RECORDED
摘要 A method for extracting subjects and sorting documents in a search engine, and a computer-readable recording medium storing a program thereof are provided to enable a user to access desired information conveniently/quickly by selecting atypical/various subjects not classified in a manual mode, classify the target documents into each subject, and determine whether the searched document is suitable for the subject. A relation degree representing that respective keywords are selected at the same time is measured for the keywords included in target documents. A convergence relation degree between a word set about the predetermined keyword and the word set related to other keywords is measured. The keyword is selected as a subject when the convergence relation degree is higher than a specific value. A naive Bayesian probability is calculated by performing naive Bayesian training for training documents and each keyword included in the target documents. A vector size of each keyword included in the training and target document is calculated. A distance between the vector size of each keyword of the training and target document is calculated. Similarity of each keyword is calculated by multiplying the naive Bayesian probability and the distance. A ranking value is calculated by processing the similarity of each keyword included in the target document.
申请公布号 KR101249183(B1) 申请公布日期 2013.04.03
申请号 KR20060079177 申请日期 2006.08.22
申请人 发明人
分类号 G06F17/40 主分类号 G06F17/40
代理机构 代理人
主权项
地址