发明名称 テキストを処理してテキストのモデルを構築する方法
摘要 Text is processed to construct a model of the text. The text has a shared vocabulary. The text is partitioned into sets and subsets of texts. The usage of the shared vocabulary in two or more sets is different, and the topics of two or more subsets are different. A probabilistic model is defined for the text. The probabilistic model considers each word in the text to be a token having a position and a word value, and the usage of the shared vocabulary, topics, subtopics, and word values for each token in the text are represented using distributions of random variables in the probabilistic model, wherein the random variables are discrete. Parameters are estimated for the model corresponding to the vocabulary usages, the word values, the topics, and the subtopics associated with the words.
申请公布号 JP5866018(B2) 申请公布日期 2016.02.17
申请号 JP20140530845 申请日期 2013.02.26
申请人 三菱電機株式会社 发明人 ハーシェイ、ジョン・アール;ル・ルー、ジョナサン;ヒークラニ、クレイトン・ケイ
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址