发明名称 Information processing apparatus, information processing method, and program
摘要 An information processing apparatus includes a category classifying unit configured to classify a document into one or more categories, a word extracting unit configured to extract one or more words from the document, a word score calculating unit configured to calculate a word score for each of the one or more words extracted from the document on the basis of an appearance frequency of the word in each of the one or more categories, the word score serving as an index of interest of the word, a sentence-for-computation extracting unit configured to extract one or more sentences from the document, and a sentence score calculating unit configured to calculate a sentence score for each of the extracted one or more sentences on the basis of the word score calculated by the word score calculating unit, the sentence score serving as an index of interest of the sentence.
申请公布号 US9122680(B2) 申请公布日期 2015.09.01
申请号 US201012880598 申请日期 2010.09.13
申请人 Sony Corporation 发明人 Isozu Masaaki;Enami Tsugutomo
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 Sherr & Jiang, PLLC 代理人 Sherr & Jiang, PLLC
主权项 1. An information processing apparatus comprising: a memory; and a processor, which: receives instructions from a user to search for a document;searches for the document in a storage associated with the information processing apparatus;stores the document in the memory;classifies the document into a first category and a second category;extracts a word from the document;determines a first total number of documents in the first category;determines a second total number of documents in the second category;determines a first number of documents as a first count of the documents in the first category that contain the word;determines a second number of the documents as a second count of the documents in the second category that contain the word;determines a first ratio of the first total number of documents and the first number of documents;determines a second ratio of the second total number of documents and the second number of documents; calculates a word score for the word using a mathematical product of the first ratio and the second ratio, wherein the word score is higher when the word appears less frequently in the category;extracts a sentence from the document;calculates a sentence score for the sentence on the basis of the word score; anddisplays the sentence on a display, when the sentence score exceeds a threshold.
地址 Tokyo JP