发明名称 METADATA EXTRACTION SERVER, METADATA EXTRACTION METHOD AND PROGRAM
摘要 <P>PROBLEM TO BE SOLVED: To automatically generate and update a proper recognizer from information on a Web. Ž<P>SOLUTION: A teacher information collection part 234 collects content from a Web server 240. A characteristic amount calculation part 204 calculates a characteristic amount matrix representing the content of the collected content. A characteristic amount DB 201 clusters the characteristic amount matrixes by group when storing them at predetermined time intervals, and determines a subspace for each group of the characteristic amount matrixes. A metadata arrangement part 203 morphologically analyzes the collected content, and generates a metadata matrix representing occurrence frequency of a morphologically-analyzed word. A correlation coefficient calculation part 214 calculates a correlation coefficient between the group of the characteristic amount matrixes and a group of the metadata matrixes in each subspace. A correlation coefficient DB 205 stores the correlation coefficient calculated for each subspace. Ž<P>COPYRIGHT: (C)2010,JPO&INPIT Ž
申请公布号 JP2010198111(A) 申请公布日期 2010.09.09
申请号 JP20090039529 申请日期 2009.02.23
申请人 NIPPON TELEGR & TELEPH CORP 发明人 KONDO SATORU;OGAWA TAKESHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址