发明名称 DEVICE AND METHOD FOR SELECTING INSTANCES IN EXPANDED SET CONTAINING GIVEN SEED STRING
摘要 <p>A seed string is inputted into an input unit (101). A search unit (102) acquires document snippets containing said seed string. A segment-acquisition unit (103) acquires segments by segmenting said snippets at a segment-delimiter string. A segment-element acquisition unit (104) acquires segment elements by segmenting the segments at a segment-element delimiter string. A segment-score calculation unit (105) uses the standard deviation of the lengths of the segment elements to calculate a score for each segment. A segment-element-score calculation unit (106) uses the segment scores and distances between the positions of the seed string and the positions of the segment elements to calculate a score for each segment element. On the basis of said segment-element scores, a selection unit (107) selects one of the segment elements as a candidate instance in an expanded set for the seed string.</p>
申请公布号 CA2801298(C) 申请公布日期 2014.11.25
申请号 CA20122801298 申请日期 2012.02.22
申请人 RAKUTEN, INC. 发明人 HAGIWARA, MASATO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址