发明名称 |
METADATA EXTRACTION SERVER, METADATA EXTRACTION METHOD AND PROGRAM |
摘要 |
<P>PROBLEM TO BE SOLVED: To automatically generate and update a proper recognizer from information on a Web. Ž<P>SOLUTION: A teacher information collection part 234 collects content from a Web server 240. A characteristic amount calculation part 204 calculates a characteristic amount matrix representing the content of the collected content. A characteristic amount DB 201 clusters the characteristic amount matrixes by group when storing them at predetermined time intervals, and determines a subspace for each group of the characteristic amount matrixes. A metadata arrangement part 203 morphologically analyzes the collected content, and generates a metadata matrix representing occurrence frequency of a morphologically-analyzed word. A correlation coefficient calculation part 214 calculates a correlation coefficient between the group of the characteristic amount matrixes and a group of the metadata matrixes in each subspace. A correlation coefficient DB 205 stores the correlation coefficient calculated for each subspace. Ž<P>COPYRIGHT: (C)2010,JPO&INPIT Ž
|
申请公布号 |
JP2010198111(A) |
申请公布日期 |
2010.09.09 |
申请号 |
JP20090039529 |
申请日期 |
2009.02.23 |
申请人 |
NIPPON TELEGR & TELEPH CORP |
发明人 |
KONDO SATORU;OGAWA TAKESHI |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|